Nik-Kras / Live_ASR_Whisper_Gradio
Real Time Speech To Text with corrections powered by Gradio
☆10Updated 3 months ago
Alternatives and similar repositories for Live_ASR_Whisper_Gradio:
Users that are interested in Live_ASR_Whisper_Gradio are comparing it to the libraries listed below
- A lightweight end-to-end text-to-speech model☆112Updated 2 months ago
- Running the F5-TTS by ONNX Runtime☆146Updated 3 weeks ago
- Speech Diarization for scrum automation☆103Updated last year
- Running the F5-TTS by ONNX Runtime standalone with GUI☆18Updated 4 months ago
- OpenAI API and Whisper based Video Translation☆73Updated 4 months ago
- Live-Transcription (STT) with Whisper PoC☆181Updated 10 months ago
- FastAPI service on top of WhisperX☆85Updated this week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- Open source inference code for Rev's model☆399Updated this week
- a gradio webui for faster whisper☆259Updated last year
- A toolkit for speaker diarization.☆184Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆159Updated last week
- ☆83Updated 9 months ago
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- ubuntu 系统下 GLM-4-Voice 部署经验分享☆19Updated 5 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆25Updated 5 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆11Updated 9 months ago
- ChatTTS-OpenAI-API is a project built upon the ChatTTS project, implementing the v1/audio/speech endpoint in compliance with OpenAI proto…☆21Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆91Updated last year
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆680Updated 4 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆43Updated 8 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 5 months ago
- F5-TTS 推理加速,速度提升约4倍!☆78Updated 3 months ago
- ☆32Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated 9 months ago
- ☆67Updated last year
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated last year