Skip to content

one-among-us/whisper-web

Repository files navigation

Whisper Web

Web application for accurate speech-to-text conversion, powered by OpenAI Whisper and Pyannote.

whisper.hydev.org

Features

  • Speech-to-text with timestamps
  • Speaker identification

Performance

Currently, the model can process 2 hours of audio in 12 minutes on an RTX 3060 graphics card.

Deployment Instructions

TODO