leduclinh7141 / BetterWhisperX
☆16Updated 6 months ago
Alternatives and similar repositories for BetterWhisperX
Users that are interested in BetterWhisperX are comparing it to the libraries listed below
Sorting:
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆49Updated last week
- Real time faster whisper gradio☆26Updated 7 months ago
- A lightweight end-to-end text-to-speech model☆114Updated 2 months ago
- ☆375Updated this week
- Added vLLM support to IndexTTS for faster inference.☆87Updated this week
- Generate ARKit expression from audio in realtime☆87Updated last week
- ☆158Updated 5 months ago
- A Low-Latency, Lightweight and High-Performance Streaming VAD☆166Updated this week
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 5 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆160Updated 3 weeks ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆39Updated 3 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆156Updated 3 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆180Updated last week
- ☆41Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆91Updated 7 months ago
- GPT-4o-level, real-time spoken dialogue system.☆324Updated 3 months ago
- ☆156Updated 6 months ago
- Model Context Protocol (MCP) server implementation with Minimax API integration☆47Updated last month
- A toolkit for speaker diarization.☆188Updated this week
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆129Updated last year
- ComfyUI wrapper for Moondream's gaze detection☆53Updated 3 months ago
- AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆65Updated 2 months ago
- ☆64Updated 8 months ago
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 6 months ago
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆194Updated this week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆105Updated 3 weeks ago
- 本地部署音视频转文字区分说话人+LLM总结 - Moded from FunClip - Offline video/auduio Transcription + SD + LLM conclusion☆34Updated 4 months ago
- ☆304Updated last week
- project page for ChatAnyone☆106Updated last month
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"☆70Updated last month