smtiitm / Fastspeech2_HS
Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
☆22Updated this week
Alternatives and similar repositories for Fastspeech2_HS:
Users that are interested in Fastspeech2_HS are comparing it to the libraries listed below
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆48Updated 9 months ago
- Text-to-Speech for languages of India☆221Updated 4 months ago
- Finetune VITS and MMS using HuggingFace's tools☆139Updated 11 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆301Updated last year
- ☆279Updated 9 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆224Updated last week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆173Updated 6 months ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆15Updated last year
- Pretraining, fine-tuning and evaluation scripts for Indic-Wav2Vec2☆84Updated last year
- ☆201Updated 10 months ago
- ☆352Updated 6 months ago
- Update ASR paper everyday☆172Updated this week
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆51Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆240Updated 9 months ago
- ☆125Updated 3 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆71Updated 9 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆143Updated last year
- ☆39Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- A python package for whisper normalizer☆53Updated 3 weeks ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆75Updated 3 years ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆156Updated last week
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆159Updated this week
- ☆45Updated 2 years ago
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆105Updated 5 months ago
- ☆254Updated last year
- ☆43Updated 2 years ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆92Updated 9 months ago
- ☆207Updated this week
- AudioBench: A Universal Benchmark for Audio Large Language Models☆171Updated this week