AI4Bharat / IndicF5Links
☆70Updated 3 months ago
Alternatives and similar repositories for IndicF5
Users that are interested in IndicF5 are comparing it to the libraries listed below
Sorting:
- ☆275Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆257Updated last year
- Fine Tune the Style-TTS2 Voice Model☆264Updated 6 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆50Updated this week
- ☆294Updated 5 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆15Updated last year
- ☆185Updated last year
- SoTA open-source TTS☆128Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆46Updated 3 months ago
- finetune llm part for spark-tts model☆119Updated 9 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆230Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆218Updated 8 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆196Updated 8 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆70Updated 7 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆187Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆55Updated 2 years ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆53Updated last year
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆198Updated 5 months ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆103Updated 6 months ago
- A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning☆12Updated 2 years ago
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago
- create dataset from list of youtube links easily☆21Updated 2 years ago
- [WIP] VoiceSmith makes training text to speech models easy.☆228Updated 3 years ago
- ☆72Updated last year