Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
☆55Updated this week
Related projects ⓘ
Alternatives and complementary repositories for open-dubbing
- ez audio transcription tool with flexible processing and post-processing options☆129Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- web based editor for subtitles and transcripts☆111Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆67Updated 6 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆43Updated 3 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- ☆77Updated 2 weeks ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 5 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆47Updated 3 weeks ago
- 🎥➡️📝 Hermes: Blazing-fast video transcription powered by AI gods! Transcribe 6.5 minutes of video in just 1 second using Groq's LPU. Ch…☆64Updated 2 months ago
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆108Updated 4 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆32Updated this week
- Integrates AI tools into Microsoft® Word® (independently developed, not affiliated with Microsoft)☆31Updated 3 weeks ago
- Text-to-speech API endpoint compatible with OpenAI's TTS API endpoint, using Microsoft Edge TTS to generate speech for free locally☆114Updated last week
- Automated LLM novelist☆35Updated 7 months ago
- Watch and hear endless conversations between two ollamas, hence the Two-Way Conversation Engine (TWICE)☆17Updated last year
- Simulates talk with an AI that can express emotions☆29Updated 3 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆39Updated 2 weeks ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆87Updated 4 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆43Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆44Updated this week
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration