Can we split at last sentence #75

SilasK · 2025-03-07T13:58:25Z

No description provided.

SilasK · 2025-03-10T06:11:59Z

As whisper can only run 30s of audio. in whisper streaming whisper is run itteratively

Whisper is run every second (even if in the publication of whisper streaming they say 2s would be optimal).
Words are commited if high confidence or two consecutive runs agree on them.

the same audio buffer is kept and rerun untill chunked. The chunking may work on tow different ways.

segment
after some time (15s by default) the audio buffer is chunked by the second last word if it is commited or the last commited word.
sentence
after some time the audio buffer is chunked by the second last sentente if all is commited.
If after 30s no sentence is found the audio is anyway chunked.

If I understand it correctly whisper running takes the same time if 1 or 30 s. But the quality is usually better if you have a whole sentence. and if one could run start to end of the same sentence in one go.

From this I would change the sentence fragmentation so that:
Adio buffer is chunked at the end of sentence <15s extept the last or maybe even at the last if the last sentence is complete.
At 15s (or the threshold selected) the audio buffer is cut anyway even no sentence is found.

Update README.md: Add Python syntax highlighting to code chunk

split at last sentence

7ed78ad

SilasK added a commit to SilasK/whisper_streaming_web that referenced this pull request Mar 12, 2025

implement change described in QuentinFuxa#75

c8d704c

QuentinFuxa mentioned this pull request Mar 13, 2025

Finish transcribe after stoping #80

Closed

3 tasks

nick134-bit pushed a commit to nick134-bit/whisper_streaming_web that referenced this pull request Mar 14, 2025

Merge pull request QuentinFuxa#75 from gaardhus/patch-1

6b1c2c5

Update README.md: Add Python syntax highlighting to code chunk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can we split at last sentence #75

Can we split at last sentence #75

Uh oh!

SilasK commented Mar 7, 2025

Uh oh!

SilasK commented Mar 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Can we split at last sentence #75

Are you sure you want to change the base?

Can we split at last sentence #75

Uh oh!

Conversation

SilasK commented Mar 7, 2025

Uh oh!

SilasK commented Mar 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant