leduclinh7141 / BetterWhisperXLinks
☆17Updated 6 months ago
Alternatives and similar repositories for BetterWhisperX
Users that are interested in BetterWhisperX are comparing it to the libraries listed below
Sorting:
- Real time faster whisper gradio☆26Updated 8 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆62Updated last month
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- Generate ARKit expression from audio in realtime☆102Updated this week
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆158Updated 3 months ago
- ☆160Updated 6 months ago
- An agentic workflow for story book generation☆30Updated 2 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆54Updated 6 months ago
- AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆69Updated 2 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆94Updated 8 months ago
- Model Context Protocol (MCP) server implementation with Minimax API integration☆48Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated this week
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆130Updated last year
- 用于SenseVoice的api项目,输出带时间戳字幕☆34Updated 7 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 2 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 months ago
- ☆405Updated 2 weeks ago
- ☆156Updated 7 months ago
- ☆108Updated this week
- Added vLLM support to IndexTTS for faster inference.☆186Updated this week
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆38Updated this week
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆76Updated last month
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆231Updated 2 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆53Updated this week
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆212Updated last month
- 这是一个 ChatTTS 音频仓库,包含用不同 seed 生成的不同音色,你可以方便地挑选你喜欢的 seed。☆50Updated last year
- MaskGCT demo page☆14Updated 3 months ago
- ☆382Updated last month
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 8 months ago
- 《高军 AI 日报》: 每天花 1 分钟时间,获取精选的前沿 AI 信息。内容涵盖但不限于 前沿 AI 资讯、AI 工具、AI 绘画、开源项目和学习教程 等等。☆47Updated 6 months ago