You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+6-3
Original file line number
Diff line number
Diff line change
@@ -41,7 +41,8 @@ Since version 2.0, Wingman AI Core acts as a "backend" API (using FastAPI and Py
41
41
- OpenAI Whisper
42
42
- Azure Whisper
43
43
- Azure Speech
44
-
- whispercpp (local, bundled with Wingman AI)
44
+
- whispercpp (local, needs to be installed separately)
45
+
- FasterWhisper (local, default)
45
46
- Wingman Pro (Azure Speech or Azure Whisper)
46
47
-**Text-to-speech** (TTS) providers:
47
48
- OpenAI TTS
@@ -138,9 +139,9 @@ You can use any LLM offering an OpenAI-compatible API and connect it to Wingman
138
139
### Windows
139
140
140
141
- Download the installer of the latest version from [wingman-ai.com](https://www.wingman-ai.com).
142
+
- If you have an NVIDIA RTX GPU, install the latest [CUDA driver](https://developer.nvidia.com/cuda-downloads) from NVIDIA to speed up the transcription process significantly.
141
143
- Install it to a directory of your choice and start the client `Wingman AI.exe`.
142
144
- The client will will auto-start `Wingman AI Core.exe` in the background
143
-
- The client will auto-start `whispercpp` in the background. If you have an NVIDIA RTX GPU, install the latest CUDA driver from NVIDIA and enable GPU acceleration in the Settings view.
144
145
145
146
If that doesn't work for some reason, try starting `Wingman AI Core.exe` manually and check the terminal or your **logs** directory for errors.
146
147
@@ -271,7 +272,9 @@ We would like to thank their creators for their great work and contributions to
"""default (model original), auto (fastest available on device), int8, int8_float16 etc. - see https://opennmt.net/CTranslate2/quantization.html#quantize-on-model-conversion"""
124
+
compute_type: str
125
+
"""cpu, cuda, auto"""
126
+
device: str
120
127
121
128
122
129
classXVASynthSettings(BaseModel):
@@ -133,9 +140,25 @@ class WhispercppSttConfig(BaseModel):
133
140
temperature: float
134
141
135
142
143
+
classFasterWhisperSttConfig(BaseModel):
144
+
beam_size: int
145
+
language: Optional[str] =None
146
+
hotwords: Optional[str] =None
147
+
best_of: int
148
+
temperature: float
149
+
no_speech_threshold: float
150
+
multilingual: bool
151
+
language_detection_threshold: float
152
+
153
+
136
154
classWhispercppTranscript(BaseModel):
137
155
text: str
156
+
157
+
158
+
classFasterWhisperTranscript(BaseModel):
159
+
text: str
138
160
language: str
161
+
language_probability: float
139
162
140
163
141
164
classAzureInstanceConfig(BaseModel):
@@ -387,7 +410,9 @@ class VoiceActivationSettings(BaseModel):
387
410
388
411
azure: AzureSttConfig
389
412
whispercpp: WhispercppSettings
413
+
fasterwhisper: FasterWhisperSettings
390
414
whispercpp_config: WhispercppSttConfig
415
+
fasterwhisper_config: FasterWhisperSttConfig
391
416
392
417
393
418
classFeaturesConfig(BaseModel):
@@ -470,6 +495,7 @@ class CommandJoystickConfig(BaseModel):
470
495
guid: Optional[str] =None
471
496
"""The joystick GUID to use. Optional."""
472
497
498
+
473
499
classCommandActionConfig(BaseModel):
474
500
keyboard: Optional[CommandKeyboardConfig] =None
475
501
"""The keyboard configuration for this action. Optional."""
@@ -618,6 +644,7 @@ class NestedConfig(BaseModel):
618
644
azure: AzureConfig
619
645
xvasynth: XVASynthTtsConfig
620
646
whispercpp: WhispercppSttConfig
647
+
fasterwhisper: FasterWhisperSttConfig
621
648
wingman_pro: WingmanProConfig
622
649
perplexity: PerplexityConfig
623
650
commands: Optional[list[CommandConfig]] =None
@@ -650,6 +677,7 @@ class BasicWingmanConfig(BaseModel):
0 commit comments