leduclinh7141 / BetterWhisperX
☆16Updated 4 months ago
Alternatives and similar repositories for BetterWhisperX:
Users that are interested in BetterWhisperX are comparing it to the libraries listed below
- A lightweight end-to-end text-to-speech model☆111Updated last month
- ☆159Updated 4 months ago
- Real time faster whisper gradio☆26Updated 5 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆156Updated last month
- ☆42Updated last year
- 如何得到最好的结果,Improve-Your-Prompt是一个用于优化prompt的prompt☆41Updated 4 months ago
- ☆157Updated 5 months ago
- 这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不 同音色,你可以方便地挑选你喜欢的 seed。☆48Updated 9 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆141Updated this week
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆120Updated last year
- ☆11Updated 7 months ago
- 鬼畜视频配音字幕同步项目,基于字幕文件srt同步接入TTS,支持GPT-Sovits ChatTTS BertVits2☆40Updated 9 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 6 months ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆67Updated this week
- AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆62Updated 2 weeks ago
- Googles NotebookLM but local☆167Updated last week
- A toolkit for speaker diarization.☆174Updated last week
- openai realtime webrtc python client☆36Updated 3 months ago
- ComfyUI wrapper for Moondream's gaze detection☆51Updated 2 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆144Updated last month
- A NextJS based app that takes a user prompt, or a YouTube url, or a Website URL, and generates a beautiful Mindmap.☆103Updated 3 weeks ago
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆95Updated last year
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆33Updated this week
- Common tasks in a single model☆33Updated last year
- openai realtime webrtc demo☆21Updated 2 months ago
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"☆60Updated last week
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆89Updated 6 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆205Updated last week
- Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice☆76Updated 4 months ago
- ChatTTS HTTP API☆52Updated 9 months ago