update the stage of run.sh and synthesize_e2e.sh, to be clear (#4057)

Echo-Nie · web-flow · commit 7883aa6cbdfe · 2025-04-21T17:21:12.000+08:00
* run.sh&#20462;&#25913;&#65306;&#20026; synthesize &#21644; synthesize_e2e &#28155;&#21152; --stage &#21442;&#25968;&#25511;&#21046; vocoder &#27169;&#22411;&#36873;&#25321;&#65292;REAMDE.md&#20462;&#25913;&#65306;&#34917;&#20805; stage &#21442;&#25968;&#35828;&#26126;&#65292;&#26126;&#30830; vocoder &#36873;&#25321;&#36923;&#36753;

* &#28155;&#21152;run.sh&#20013;stage&#21442;&#25968;&#30456;&#20851;&#30340;&#27880;&#37322;

* HiFiGAN&#25913;&#20026;MultiBand MelGAN

* cmsc&#25991;&#20214;&#25913;&#22238;&#21407;&#20301;&#65288;No.15&#19981;&#20462;&#25913;&#65289;&#65292;&#36825;&#37324;&#21482;&#23545;No.6&#20570;&#20462;&#25913;

* update the stage of run.sh and synthesize_e2e.sh, to be clear

* fix the md
diff --git a/examples/aishell3/ernie_sat/README.md b/examples/aishell3/ernie_sat/README.md
@@ -13,7 +13,7 @@ In ERNIE-SAT, we propose two innovations:
 ## Dataset
 ### Download and Extract
 Download AISHELL-3 from it's [Official Website](http://www.aishelltech.com/aishell_3) and extract it to `~/datasets`. Then the dataset is in the directory `~/datasets/data_aishell3`.
- 
+
 ### Get MFA Result and Extract
 We use [MFA2.x](https://github.com/MontrealCorpusTools/Montreal-Forced-Aligner) to get durations for aishell3_fastspeech2.
 You can download from here [aishell3_alignment_tone.tar.gz](https://paddlespeech.cdn.bcebos.com/MFA/AISHELL-3/with_tone/aishell3_alignment_tone.tar.gz), or train your MFA model reference to [mfa example](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples/other/mfa) (use MFA1.x now) of our repo.
@@ -138,7 +138,7 @@ You can check the text of downloaded wavs in `source/README.md`.
 ```bash
 ./run.sh --stage 3 --stop-stage 3 --gpus 0
 ```
-`stage 3` of `run.sh` calls `local/synthesize_e2e.sh`, `stage 0` of it is **Speech Synthesis** and  `stage 1` of it is **Speech Editing**.
+`stage 3` of `run.sh` calls `local/synthesize_e2e.sh`. `synthesize_e2e.sh` is a script for performing both **Speech Synthesis** and **Speech Editing** tasks by default. It converts input text into speech for synthesis and modifies existing speech based on new text content for editing.
 
 You can modify `--wav_path`&#12289;`--old_str` and `--new_str` yourself, `--old_str`  should be the text corresponding to the audio of  `--wav_path`, `--new_str` should be designed according to `--task_name`, both `--source_lang` and `--target_lang` should be `zh` for model trained with AISHELL3 dataset.
 ## Pretrained Model