Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
☆208Updated 2 months ago
Alternatives and similar repositories for open-dubbing:
Users that are interested in open-dubbing are comparing it to the libraries listed below
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆109Updated 2 months ago
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆211Updated 3 weeks ago
- G2P☆227Updated this week
- ☆96Updated last year
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆167Updated 3 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆686Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 9 months ago
- A UI for the Piper TTS☆89Updated 8 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆392Updated this week
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆54Updated 8 months ago
- ☆64Updated last month
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆324Updated 2 weeks ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆261Updated 2 weeks ago
- FastAPI service on top of WhisperX☆92Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆34Updated 6 months ago
- Slightly improved official version for finetune xtts☆336Updated last month
- ☆223Updated last month
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆156Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 6 months ago
- Open source inference code for Rev's model☆402Updated last week
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆350Updated 10 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆46Updated 4 months ago
- Live-Transcription (STT) with Whisper PoC☆181Updated 10 months ago
- API server for Instant voice cloning by MyShell.☆91Updated 7 months ago
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆37Updated 8 months ago
- Input a YouTube video link or upload a video file and get a video with subtitles.☆119Updated 8 months ago
- 🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!☆201Updated 5 months ago