NewComer00
diff --git a/‎.github/workflows/packaging.yml‎
Lines changed: 4 additions & 0 deletions b/‎.github/workflows/packaging.yml‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎README.en.md‎
Lines changed: 23 additions & 16 deletions b/‎README.en.md‎
Lines changed: 23 additions & 16 deletions
diff --git a/‎README.md‎
Lines changed: 16 additions & 12 deletions b/‎README.md‎
Lines changed: 16 additions & 12 deletions
diff --git a/‎examples/Прекрасное Далеко/expressive_config.json‎
Lines changed: 4 additions & 4 deletions b/‎examples/Прекрасное Далеко/expressive_config.json‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎examples/テトリス/expressive_config.json‎
Lines changed: 4 additions & 4 deletions b/‎examples/テトリス/expressive_config.json‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎examples/明天会更好/expressive_config.json‎
Lines changed: 4 additions & 4 deletions b/‎examples/明天会更好/expressive_config.json‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎expressions/pitd.py‎
Lines changed: 8 additions & 7 deletions b/‎expressions/pitd.py‎
Lines changed: 8 additions & 7 deletions
@@ -53,6 +53,10 @@ jobs:
           pip install -e ".[${{ matrix.extras }}]"
           pip install pyinstaller
 
+      - name: Download rmvpe-onnx model
+        run: |
+          rmvpe-onnx download
+
       - name: Build executable with PyInstaller
         run: |
           pyinstaller --noconfirm build/expressive.spec
 
@@ -21,7 +21,7 @@ The current version supports importing the following expression parameters:
 
 | **Working with OpenUtau** | **Data Viewer** |
 |:---:|:---:|
-| <img src="https://github.com/user-attachments/assets/268b44d4-528d-481e-acfb-3f7da7261c80" width="100%" /> | <img src="https://github.com/user-attachments/assets/ef97aa6a-5938-42f1-bd4a-78f268109db8" width="100%" /> |
+| <img src="https://github.com/user-attachments/assets/268b44d4-528d-481e-acfb-3f7da7261c80" width="100%" /> | <img src="https://github.com/user-attachments/assets/91ddadee-62cd-4420-abf0-dd9177e8f935" width="100%" /> |
 
 </div>
 
@@ -43,9 +43,9 @@ The current version supports importing the following expression parameters:
 * OpenUtau Beta (or other versions with DiffSinger support)
 * Python 3.10 \*
 
-By default, this application uses [swift-f0](https://github.com/lars76/swift-f0) (based on ONNX Runtime) as the pitch extraction backend, which runs on CPU only and satisfies basic usage scenarios.
+By default, this application uses [rmvpe-onnx](https://github.com/newcomer00/rmvpe-onnx) as the pitch extraction backend, which runs on CPU only. [RMVPE](https://arxiv.org/abs/2306.15412v2) is currently the best-performing publicly available pitch extraction algorithm, and its inference speed is fast enough to satisfy the vast majority of use cases.
 
-The classic [CREPE](https://github.com/marl/crepe) pitch extraction backend (depends on TensorFlow) is also available for scenarios with higher accuracy requirements. If your computer is equipped with an NVIDIA GPU and supports [CUDA 11.x](https://docs.nvidia.com/deploy/cuda-compatibility/minor-version-compatibility.html) (i.e., GPU driver version >= 450), the CREPE backend will automatically enable GPU acceleration.
+The [swift-f0](https://github.com/lars76/swift-f0) and [CREPE](https://github.com/marl/crepe) pitch extraction backends are also available. The former runs on CPU only and is the fastest option, though its accuracy is modest. The latter is a classic algorithm in the field and runs more slowly. In a CUDA environment, the CREPE backend will automatically enable GPU acceleration.
 
 > \* On Windows, TensorFlow 2.10 is the last version that supports GPU acceleration, and Python 3.10 is the highest Python version supported by its `.whl` files.
 
@@ -79,6 +79,7 @@ A new USTX file with expression parameters added. The original project will not
 * [x] Linux support
 * [x] NVIDIA GPU acceleration
 * [x] Parameter config import/export
+* [x] Expression curve visualization
 * [x] `Pitch Deviation` generation
 * [x] `Dynamics` generation
 * [x] `Tension` generation
@@ -87,15 +88,15 @@ A new USTX file with expression parameters added. The original project will not
 
 You can download pre-compiled executable files directly from the [Releases](https://github.com/NewComer00/expressive/releases) page:
 
-### `Expressive-GUI-<version>-Windows-x64-CPU.exe`
+### `Expressive-<version>-Windows-x64-CPU.exe`
 
-GUI installer for Windows x64 architecture.
+Expressive CLI / GUI / Viewer installer for Windows x64 architecture.
 
 CPU-only, no CUDA runtime libraries included. Small installation size, but slower when using the CREPE backend for pitch extraction.
 
-### `Expressive-GUI-<version>-Windows-x64-GPU.exe`
+### `Expressive-<version>-Windows-x64-GPU.exe`
 
-GUI installer for Windows x64 architecture with GPU support.
+Expressive CLI / GUI / Viewer installer for Windows x64 architecture with GPU support.
 
 Includes CUDA runtime libraries. When used on a computer with an NVIDIA GPU (driver version >= 450), it significantly improves CREPE backend inference speed.
 
@@ -127,6 +128,8 @@ pip install -e ".[gpu,gui]"
 
 After installation, you can use the `expressive` and `expressive-gui` entry points to run the **command-line interface** and **graphical user interface**.
 
+You can also launch a standalone expression curve visualization tool via the `expressive-viewer` command to view and analyze expression curves extracted by `expressive` and `expressive-gui` in real time.
+
 ## 📖 Usage
 
 > [!TIP]
@@ -143,6 +146,16 @@ After installation, you can use the `expressive` and `expressive-gui` entry poin
 > LANGUAGE="en_US" expressive-gui
 > ```
 
+> [!IMPORTANT]
+> For users who installed from source, when using the [rmvpe-onnx](https://github.com/newcomer00/rmvpe-onnx) backend, the application will automatically download the model file [rmvpe.onnx (Copyright (c) 2022 lj1995 — MIT License)](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.onnx) from Hugging Face.
+>
+> If you wish to download the model file in advance, you can run the following command after installation:
+> ```bash
+> rmvpe-onnx download
+> ```
+>
+> If you installed the application via the installer, the model file is already included in the installation package, and no additional download is required.
+
 ### Command Line Interface (CLI)
 
 Display help:
@@ -212,7 +225,7 @@ You can inspect the details of the expression curves in `expressive-viewer`, ana
 
 The [`examples/` directory](examples/) contains several sample projects. You can import the `expressive_config.json` file from any example into the GUI to automatically populate all parameters with the preset values.
 
-If you installed the application from the installer, a shortcut named `Expressive-examples` pointing to the examples directory will appear on your desktop after installation — you can import the config files directly from there.
+If you installed the application from the installer, a shortcut named `Expressive Examples` pointing to the examples directory will appear on your desktop after installation — you can import the config files directly from there.
 
 ## 🔬 Algorithm Workflow
 ```mermaid
@@ -293,10 +306,7 @@ The extracted PITD expression curve is too flat, with almost no significant vari
 The two confidence thresholds in the PITD extractor are set **too high**, causing many pitch changes to be discarded.
 
 #### Solution
-Try lowering both confidence thresholds. In general, the **Utau vocal** is relatively clean, so it is advisable to first adjust the confidence threshold for the **Reference vocal**.
-
-#### Future Plan
-Introduce a better PITD backend (e.g., [RMVPE](https://github.com/Dream-High/RMVPE)).
+First try using the best-performing rmvpe-onnx backend (with default confidence thresholds). If the issue persists, try lowering both confidence thresholds. In general, the **Utau vocal** is relatively clean, so it is advisable to first adjust the confidence threshold for the **Reference vocal**.
 
 ### PITD expression curve has sudden jumps or spikes at certain positions
 
@@ -307,7 +317,4 @@ The PITD expression curve changes too rapidly at certain positions, with very la
 The two confidence thresholds in the PITD extractor are set **too low**, causing erroneous detection results to be accepted.
 
 #### Solution
-Try increasing both confidence thresholds. In general, the **Utau vocal** is relatively clean, so it is advisable to first adjust the confidence threshold for the **Reference vocal**.
-
-#### Future Plan
-Introduce a better PITD backend (e.g., [RMVPE](https://github.com/Dream-High/RMVPE)).
+First try using the best-performing rmvpe-onnx backend (with default confidence thresholds). If the issue persists, try increasing both confidence thresholds. In general, the **Utau vocal** is relatively clean, so it is advisable to first adjust the confidence threshold for the **Reference vocal**.
@@ -21,7 +21,7 @@
 
 | **工作流程** | **数据可视化** |
 |:---:|:---:|
-| <img src="https://github.com/user-attachments/assets/268b44d4-528d-481e-acfb-3f7da7261c80" width="100%" /> | <img src="https://github.com/user-attachments/assets/ef97aa6a-5938-42f1-bd4a-78f268109db8" width="100%" /> |
+| <img src="https://github.com/user-attachments/assets/268b44d4-528d-481e-acfb-3f7da7261c80" width="100%" /> | <img src="https://github.com/user-attachments/assets/91ddadee-62cd-4420-abf0-dd9177e8f935" width="100%" /> |
 
 </div>
 
@@ -43,9 +43,9 @@
 * OpenUtau Beta（或支持 DiffSinger 的其他版本）
 * Python 3.10 \*
 
-本应用默认选择 [swift-f0](https://github.com/lars76/swift-f0)（基于 ONNX Runtime）作为音高提取后端，仅需 CPU 即可运行，可满足基础使用场景。
+本应用默认选择 [rmvpe-onnx](https://github.com/newcomer00/rmvpe-onnx) 作为音高提取后端，仅需 CPU 即可运行。[RMVPE](https://arxiv.org/abs/2306.15412v2) 是目前公开的效果最好的音高提取算法，且推理速度较快，可以满足绝大多数使用场景。
 
-也提供了经典的 [CREPE](https://github.com/marl/crepe)（依赖 TensorFlow）音高提取后端，适合更高要求的使用场景。若您的电脑配有 NVIDIA 显卡且支持 [CUDA 11.x](https://docs.nvidia.com/deploy/cuda-compatibility/minor-version-compatibility.html)（即显卡驱动版本 >= 450），使用 CREPE 后端时会自动启用 GPU 加速。
+应用也提供了 [swift-f0](https://github.com/lars76/swift-f0) 与 [CREPE](https://github.com/marl/crepe) 音高提取后端。前者仅依赖 CPU，效果一般，但速度最快。后者是业内的经典算法，速度较慢。在 CUDA 环境下，CREPE 后端会自动启用 GPU 加速。
 
 > \* 在 Windows 平台下，TensorFlow 2.10 是最后一个支持 GPU 加速的版本，Python 3.10 是它的 `.whl` 文件支持的最高 Python 版本。
 
@@ -146,6 +146,16 @@ pip install -e ".[gpu,gui]"
 > LANGUAGE="en_US" expressive-gui
 > ```
 
+> [!IMPORTANT]
+> 从源码安装的用户在运行 [rmvpe-onnx](https://github.com/newcomer00/rmvpe-onnx) 后端时，应用会自动从 Hugging Face 下载模型文件 [rmvpe.onnx（Copyright (c) 2022 lj1995 — MIT License）](https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.onnx)。
+>
+> 如果您希望提前下载模型文件，可在安装完成后运行以下命令：
+> ```bash
+> rmvpe-onnx download
+> ```
+>
+> 若您是通过安装包获取的本应用，安装包中已包含该模型文件，无需额外下载。
+
 ### 命令行界面（CLI）
 
 显示帮助信息
@@ -220,7 +230,7 @@ expressive-viewer
 
 项目的 [`examples/` 目录](examples/)下存放有多个示例。您可以在图形用户界面中导入相应示例的 `expressive_config.json` 配置文件，将预设的参数一键填写到应用中。
 
-若您是从安装包获取的本应用，安装完毕后示例目录的快捷方式 `Expressive-examples` 将出现在您的桌面，您也可以直接导入其中的配置文件。
+若您是从安装包获取的本应用，安装完毕后示例目录的快捷方式 `Expressive Examples` 将出现在您的桌面，您也可以直接导入其中的配置文件。
 
 ## 🔬 算法流程
 ```mermaid
@@ -301,10 +311,7 @@ NiceGUI 框架已经开始着手改进文件拖拽支持，应该在未来的版
 PITD 表情提取器中，两个置信度阈值设置**过高**，许多音高变化没有被采信。
 
 #### 解决方案
-尝试降低两个置信度阈值。一般来说，**歌姬音声**比较纯净，可以先调整**参考人声**的置信度阈值。
-
-#### 未来计划
-引入更好的 PITD 后端（如 [RMVPE](https://github.com/Dream-High/RMVPE)）。
+请先尝试使用效果最好的 rmvpe-onnx 后端（默认置信度阈值）。若问题仍在，尝试降低两个置信度阈值。一般来说，**歌姬音声**比较纯净，可以先调整**参考人声**的置信度阈值。
 
 ### PITD 表情曲线在某些位置变化过快，出现跳跃或毛刺
 
@@ -315,7 +322,4 @@ PITD 表情曲线在某些位置变化过快，出现非常大的跳跃或毛刺
 PITD 表情提取器中，两个置信度阈值设置**过低**，错误的识别结果被采信。
 
 #### 解决方案
-尝试增加两个置信度阈值。一般来说，**歌姬音声**比较纯净，可以先调整**参考人声**的置信度阈值。
-
-#### 未来计划
-引入更好的 PITD 后端（如 [RMVPE](https://github.com/Dream-High/RMVPE)）。
+请先尝试使用效果最好的 rmvpe-onnx 后端（默认置信度阈值）。若问题仍在，尝试增加两个置信度阈值。一般来说，**歌姬音声**比较纯净，可以先调整**参考人声**的置信度阈值。
@@ -18,9 +18,9 @@
         },
         "pitd": {
             "selected": true,
-            "backend": "crepe",
-            "confidence_utau": 0.8,
-            "confidence_ref": 0.6,
+            "backend": "rmvpe-onnx",
+            "confidence_utau": null,
+            "confidence_ref": null,
             "align_radius": 1,
             "semitone_shift": 0,
             "smoothness": 4,
@@ -35,4 +35,4 @@
             "bias": 10
         }
     }
-}
+}
@@ -18,9 +18,9 @@
         },
         "pitd": {
             "selected": true,
-            "backend": "swift-f0",
-            "confidence_utau": 0.85,
-            "confidence_ref": 0.9,
+            "backend": "rmvpe-onnx",
+            "confidence_utau": null,
+            "confidence_ref": null,
             "align_radius": 1,
             "semitone_shift": 0,
             "smoothness": 2,
@@ -35,4 +35,4 @@
             "bias": 10
         }
     }
-}
+}
@@ -18,9 +18,9 @@
         },
         "pitd": {
             "selected": true,
-            "backend": "swift-f0",
-            "confidence_utau": 0.9,
-            "confidence_ref": 0.93,
+            "backend": "rmvpe-onnx",
+            "confidence_utau": null,
+            "confidence_ref": null,
             "align_radius": 1,
             "semitone_shift": 0,
             "smoothness": 2,
@@ -35,4 +35,4 @@
             "bias": 10
         }
     }
-}
+}
@@ -26,13 +26,14 @@ class PitdLoader(ExpressionLoader):
     expression_name = "pitd"
     expression_info = _l("Pitch Deviation (curve)")
     backend_choices = {
-        "swift-f0": _l("fast, CPU-based (ONNX Runtime)"),
-        "crepe": _l("classic but slow, CPU & NVIDIA GPU (TensorFlow)"),
+        "rmvpe-onnx": _l("finest accuracy, fast, CPU only (ONNX Runtime)"),
+        "swift-f0": _l("fair accuracy, fastest, CPU only (ONNX Runtime)"),
+        "crepe": _l("good accuracy, slow, CPU & NVIDIA GPU (TensorFlow)"),
     }
-    confidence_utau_recommended = {"swift-f0": 0.95, "crepe": 0.8}
-    confidence_ref_recommended  = {"swift-f0": 0.93, "crepe": 0.6}
+    confidence_utau_recommended = {"rmvpe-onnx": 0.03, "swift-f0": 0.95, "crepe": 0.80}
+    confidence_ref_recommended  = {"rmvpe-onnx": 0.03, "swift-f0": 0.93, "crepe": 0.60}
     args = SimpleNamespace(
-        backend         = Args(name="backend"        , type=str  , default="swift-f0", choices=list(backend_choices.keys()), help=_lf("**F0 detection backend** for extracting pitch from WAV files. Available options:\n\n%s\n\n", lambda: "\n".join([f"- `{k}`: {v}" for k, v in PitdLoader.backend_choices.items()]))),  # noqa: E501
+        backend         = Args(name="backend"        , type=str  , default="rmvpe-onnx", choices=list(backend_choices.keys()), help=_lf("**F0 detection backend** for extracting pitch from WAV files. Available options:\n\n%s\n\n", lambda: "\n".join([f"- `{k}`: {v}" for k, v in PitdLoader.backend_choices.items()]))),  # noqa: E501
         confidence_utau = Args(name="confidence_utau", type=float, default=None, help=_lf("Minimum **confidence level** for keeping detected pitch values in the **UTAU** WAV. Lower values retain more frames but may include errors. Omit to use the recommended value for the selected backend:\n\n%s\n\n", lambda: "\n".join([f"- `{k}`: {v}" for k, v in PitdLoader.confidence_utau_recommended.items()]))),  # noqa: E501
         confidence_ref  = Args(name="confidence_ref" , type=float, default=None, help=_lf("Minimum **confidence level** for keeping detected pitch values in the **reference** WAV. Lower values retain more frames but may include errors. Omit to use the recommended value for the selected backend:\n\n%s\n\n", lambda: "\n".join([f"- `{k}`: {v}" for k, v in PitdLoader.confidence_ref_recommended.items()]))),  # noqa: E501
         align_radius    = Args(name="align_radius"   , type=int  , default=1   , help=_l("**Radius** for the FastDTW alignment algorithm; larger values allow more flexible alignment but increase computation time")),  # noqa: E501
@@ -114,12 +115,12 @@ def get_expression(
         return self.expression_tick, self.expression_val
 
 
-def get_wav_features(wav_path, backend="swift-f0", confidence_threshold=0.8, confidence_filter_size=9):
+def get_wav_features(wav_path, backend="rmvpe-onnx", confidence_threshold=0.8, confidence_filter_size=9):
     """Extract features from a WAV file.
 
     Args:
         wav_path (str): Path to the WAV file.
-        backend (str, optional): F0 detection backend ("crepe" or "swift-f0"). Defaults to "swift-f0".
+        backend (str, optional): F0 detection backend ("crepe" or "swift-f0" or "rmvpe-onnx"). Defaults to "rmvpe-onnx".
         confidence_threshold (float, optional): Confidence threshold for pitch detection. Defaults to 0.8.
         confidence_filter_size (int, optional): Size of the median filter for confidence. Defaults to 9.