Nik-Kras / Live_ASR_Whisper_Gradio
Real Time Speech To Text with corrections powered by Gradio
☆10Updated 2 months ago
Alternatives and similar repositories for Live_ASR_Whisper_Gradio:
Users that are interested in Live_ASR_Whisper_Gradio are comparing it to the libraries listed below
- A lightweight end-to-end text-to-speech model☆111Updated last month
- Speech Diarization for scrum automation☆102Updated last year
- Open source inference code for Rev's model☆389Updated 3 weeks ago
- FastAPI service on top of WhisperX☆78Updated this week
- Running the F5-TTS by ONNX Runtime☆135Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- ☆83Updated 9 months ago
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆117Updated last year
- A toolkit for speaker diarization.☆174Updated last week
- a gradio webui for faster whisper☆258Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆93Updated 11 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 6 months ago
- web based editor for subtitles and transcripts☆128Updated 7 months ago
- OpenAI API and Whisper based Video Translation☆73Updated 3 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆258Updated 4 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆156Updated last month
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆102Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Have a natural voice conversation with an LLM☆246Updated 3 months ago
- Tell a story and get a live feed of images.☆136Updated last year
- ☆173Updated last year
- 基于 OpenAI Realtime Console 修改的语音聊天应用。支持定义 api base url。☆31Updated 5 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆173Updated 6 months ago
- ☆42Updated last year
- Accelerating faster-whisper single file processing by multiprocessing through parallelization☆53Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆154Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- Free Search is a wrapper on top of publicly available SearXNG instances to give free internet access as a rest API.☆147Updated this week