AI4Bharat / IndicF5Links
β58Updated this week
Alternatives and similar repositories for IndicF5
Users that are interested in IndicF5 are comparing it to the libraries listed below
Sorting:
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β252Updated last year
- β43Updated 3 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quβ¦β42Updated this week
- A TTS model capable of generating ultra-realistic dialogue in one pass.β208Updated 5 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archiβ¦β191Updated 5 months ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023β54Updated 2 years ago
- β169Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β65Updated 10 months ago
- β281Updated 2 months ago
- β274Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ252Updated 3 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ185Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated 2 weeks ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"β101Updated 3 months ago
- Text-to-Speech for languages of Indiaβ275Updated 10 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASRβ62Updated 3 months ago
- Identify the emotion of multiple speakers in an Audio Segmentβ175Updated 2 years ago
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latencyβ59Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ185Updated last year
- Create training data for training a voice cloner for bark text to speech.β46Updated 2 years ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ157Updated last year
- finetune llm part for spark-tts modelβ109Updated 6 months ago
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translationβ181Updated last month
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTSβ49Updated 9 months ago
- Run Retrieval-based Voice Conversion training and inference with ease.β11Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β42Updated last week
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quβ¦β14Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β85Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ100Updated 3 months ago
- Your one-stop solution for voice dataset creationβ124Updated last year