You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+17-5
Original file line number
Diff line number
Diff line change
@@ -43,13 +43,13 @@ This library uses:
43
43
pip install RealtimeSTT
44
44
```
45
45
46
-
This will install all the necessary dependencies, including a CPU support only version of PyTorch.
46
+
This will install all the necessary dependencies, including a **CPU support only** version of PyTorch.
47
47
48
48
### GPU Support with CUDA (recommended)
49
49
50
-
Additional steps are needed for a GPU-optimized installation. These steps are recommended for those who require better performance and have a compatible NVIDIA GPU.
50
+
Additional steps are needed for a **GPU-optimized** installation. These steps are recommended for those who require **better performance** and have a compatible NVIDIA GPU.
51
51
52
-
If you plan to use RealtimeSTT with GPU support via CUDA, please follow these steps:
52
+
To use RealtimeSTT with GPU support via CUDA please follow these steps:
53
53
54
54
1.**Install NVIDIA CUDA Toolkit 11.8**:
55
55
- Visit [NVIDIA CUDA Toolkit Archive](https://developer.nvidia.com/cuda-11-8-0-download-archive).
@@ -106,7 +106,7 @@ print(recorder.text())
106
106
107
107
### Callbacks
108
108
109
-
You can set callback functions to be executed when recording starts or stops:
109
+
You can set callback functions to be executed on different events (see [Configuration](#configuration)) :
110
110
111
111
```python
112
112
def my_start_callback():
@@ -151,7 +151,9 @@ The test subdirectory contains a set of scripts to help you evaluate and underst
151
151
152
152
### Initialization Parameters for `AudioToTextRecorder`
153
153
154
-
When you initialize the `AudioToTextRecorder` class, you have various options to customize its behavior. Here are the available parameters:
154
+
When you initialize the `AudioToTextRecorder` class, you have various options to customize its behavior.
155
+
156
+
#### General Parameters
155
157
156
158
- **model** (str, default="tiny"): Model size or path for transcription.
- **silero_sensitivity** (float, default=0.6): Sensitivity for Silero's voice activity detection ranging from 0 (least sensitive) to 1 (most sensitive). Default is 0.6.
171
177
172
178
- **webrtc_sensitivity** (int, default=3): Sensitivity for the WebRTC Voice Activity Detection engine ranging from 1 (least sensitive) to 3 (most sensitive). Default is 3.
@@ -179,6 +185,8 @@ When you initialize the `AudioToTextRecorder` class, you have various options to
179
185
180
186
- **pre_recording_buffer_duration** (float, default=0.2): The time span, in seconds, during which audio is buffered prior to formal recording. This helps counterbalancing the latency inherent in speech activity detection, ensuring no initial audio is missed.
181
187
188
+
#### Wake Word Parameters
189
+
182
190
- **wake_words** (str, default=""): Wake words for initiating the recording. Multiple wake words can be provided as a comma-separated string. Supported wake words are: alexa, americano, blueberry, bumblebee, computer, grapefruits, grasshopper, hey google, hey siri, jarvis, ok google, picovoice, porcupine, terminator
183
191
184
192
- **wake_words_sensitivity** (float, default=0.6): Sensitivity level for wake word detection (0 for least sensitive, 1 for most sensitive).
@@ -191,6 +199,10 @@ When you initialize the `AudioToTextRecorder` class, you have various options to
191
199
192
200
- **on_wakeword_timeout**: Callback function to be called when the system goes back to an inactive state after when no speech was detected after wake word activation.
193
201
202
+
- **on_wakeword_detection_start**: Callback function to be called the system starts to listen for wake words
203
+
204
+
- **on_wakeword_detection_end**: Callback function to be called when stopping to listen for wake words (e.g. because of timeout or wake word detected)
0 commit comments