feat: add NVIDIA Parakeet model support via sherpa-onnx by gabrielste1n · Pull Request #146 · OpenWhispr/openwhispr

gabrielste1n · 2026-01-24T00:50:03Z

Summary

Add NVIDIA Parakeet speech-to-text support via sherpa-onnx as an alternative to whisper.cpp, along with several bug fixes for hotkey initialization and model management.

Changes

NVIDIA Parakeet Integration

Introduced NVIDIA Parakeet ASR model support using the sherpa-onnx runtime for cross-platform ONNX inference
Added parakeet.js for model management — downloading, verifying, and resolving bundled sherpa-onnx binaries per platform/arch
Added parakeetServer.js as a sherpa-onnx CLI wrapper that spawns the transcription process
Added parakeetWsServer.js implementing the sherpa-onnx WebSocket protocol for streaming audio to the server
Added ffmpegUtils.js for audio format conversion (WebM/Opus → 16kHz mono PCM WAV) required by sherpa-onnx
Added serverUtils.js with port-finding and server lifecycle utilities
Added scripts/download-sherpa-onnx.js build script to fetch platform-specific sherpa-onnx binaries
Added modelDirUtils.js for shared model directory path resolution
Registered Parakeet model (parakeet-tdt-0.6b-v3, multilingual, ~680MB) in the centralized model registry

UI & Settings

Updated TranscriptionModelPicker.tsx to support selecting between Whisper and Parakeet models with separate download/status tracking
Separated Parakeet and Whisper model settings (parakeetModel vs whisperModel) so each engine retains its own selection
Added new IPC channels for Parakeet operations: model listing, downloading, deletion, status checks, and server management
Extended useSettings.ts and useModelDownload.ts hooks for Parakeet model state
Added comprehensive TypeScript types in electron.ts for all new IPC APIs

Hotkey Fixes

Fixed hotkey initialization race condition where the saved hotkey was never restored on startup because a did-finish-load listener was registered after loadMainWindow() already resolved
Fixed async persistence and JS injection escaping issues in hotkey manager

Model Management Fixes

Fixed whisper.cpp large/turbo model lookup failures by using the registry fileName field for model paths instead of deriving filenames from model names
Centralized model definitions in modelRegistryData.json as single source of truth, removing hardcoded model lists from multiple files

Audio Pipeline

Updated audioManager.js to route transcription to either Whisper or Parakeet based on the selected engine
Added FFmpeg-based audio conversion pipeline ensuring correct sample rate and format for sherpa-onnx input

Implement cross-platform support for NVIDIA's Parakeet TDT ASR models using sherpa-onnx runtime. Parakeet provides 50x faster transcription than Whisper with comparable accuracy. New features: - Add parakeet-tdt-0.6b-v2 (English) and v3 (25 languages) models - Create ParakeetServerManager for CLI-based transcription - Add sherpa-onnx binary download script for all platforms - Enable NVIDIA Parakeet tab in TranscriptionModelPicker UI - Support model download, delete, and selection Architecture follows existing patterns: - parakeet.js mirrors whisper.js for model management - parakeetServer.js uses sherpa-onnx-offline CLI - IPC handlers follow whisper handler patterns - useModelDownload hook extended for 'parakeet' type

…ription routing

…urbo model lookup failures

…t-cross-platform-l5JKr

…sing

…injection escaping

…d listener that never fired after awaited loadMainWindow()

…cket protocol

Resolve conflicts in CHANGELOG.md, CLAUDE.md, electron-builder.json, and package.json by combining Parakeet/sherpa-onnx additions with Windows push-to-talk, custom dictionary, and shared download utilities from main.

claude and others added 13 commits January 24, 2026 00:07

refactor: improve code quality and centralize Parakeet model definitions

0170969

Merge main into feature branch, resolve conflicts

8be7fac

fix: separate Parakeet and Whisper model settings to fix local transc…

1c836f8

…ription routing

fix: use registry fileName for whisper model paths to resolve large/t…

bc0df4a

…urbo model lookup failures

Merge remote-tracking branch 'origin/main' into claude/nvidia-parakee…

6ab7879

…t-cross-platform-l5JKr

chore: docs

462530f

fix: Parakeet transcription with FFmpeg audio conversion and JSON par…

21b5f0d

…sing

fix: hotkey initialization race condition, async persistence, and JS …

6706566

…injection escaping

fix: restore saved hotkey on startup by removing stale did-finish-loa…

8828dcf

…d listener that never fired after awaited loadMainWindow()

refactor: parakeet websocket server

3cde531

fix: parakeet transcription by implementing correct sherpa-onnx WebSo…

8e1ace1

…cket protocol

Merge origin/main into claude/nvidia-parakeet-cross-platform-l5JKr

06a678e

Resolve conflicts in CHANGELOG.md, CLAUDE.md, electron-builder.json, and package.json by combining Parakeet/sherpa-onnx additions with Windows push-to-talk, custom dictionary, and shared download utilities from main.

gabrielste1n closed this Jan 27, 2026

gabrielste1n deleted the claude/nvidia-parakeet-cross-platform-l5JKr branch January 27, 2026 05:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add NVIDIA Parakeet model support via sherpa-onnx#146

feat: add NVIDIA Parakeet model support via sherpa-onnx#146
gabrielste1n wants to merge 13 commits intomainfrom
claude/nvidia-parakeet-cross-platform-l5JKr

gabrielste1n commented Jan 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

gabrielste1n commented Jan 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

NVIDIA Parakeet Integration

UI & Settings

Hotkey Fixes

Model Management Fixes

Audio Pipeline

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gabrielste1n commented Jan 24, 2026 •

edited

Loading