Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Whisper can produce invalid UTF-8 #1

Open
edef1c opened this issue May 19, 2023 · 2 comments
Open

Whisper can produce invalid UTF-8 #1

edef1c opened this issue May 19, 2023 · 2 comments

Comments

@edef1c
Copy link

edef1c commented May 19, 2023

thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: InvalidUtf8 { error_len: None, valid_up_to: 328 }', src/filter/imp.rs:139:18
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
ERROR: from element /GstPipeline:pipeline0/GstWhisperFilter:whisperfilter0: Panicked: called `Result::unwrap()` on an `Err` value: InvalidUtf8 { error_len: None, valid_up_to: 328 }
Execution ended after 0:47:34.668096672
Setting pipeline to NULL ...
ERROR: from element /GstPipeline:pipeline0/GstWhisperFilter:whisperfilter0: Panicked
ERROR: from element /GstPipeline:pipeline0/GstAutoAudioSrc:autoaudiosrc0/GstPulseSrc:autoaudiosrc0-actual-src-puls: Internal data stream error.
Additional debug info:
../libs/gst/base/gstbasesrc.c(3132): gst_base_src_loop (): /GstPipeline:pipeline0/GstAutoAudioSrc:autoaudiosrc0/GstPulseSrc:autoaudiosrc0-actual-src-puls:
streaming stopped, reason error (-5)
ERROR: from element /GstPipeline:pipeline0/GstWhisperFilter:whisperfilter0: Panicked
ERROR: from element /GstPipeline:pipeline0/GstQueue:queue0: Internal data stream error.
Additional debug info:
../plugins/elements/gstqueue.c(992): gst_queue_handle_sink_event (): /GstPipeline:pipeline0/GstQueue:queue0:
streaming stopped, reason flushing (-2)
ERROR: from element /GstPipeline:pipeline0/GstWhisperFilter:whisperfilter0: Panicked
Freeing pipeline ...
@jbg
Copy link
Contributor

jbg commented May 19, 2023

@jbg
Copy link
Contributor

jbg commented May 20, 2023

We don't panic when whisper-rs returns an error from full_get_segment_text() now: 32d96df

But if whisper-rs API is improved, we can maybe recover some text from the segment rather than ignoring it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants