Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
☆17Feb 9, 2024Updated 2 years ago
Alternatives and similar repositories for Fastspeech2_MFA
Users that are interested in Fastspeech2_MFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-to-Speech for languages of India☆361Nov 8, 2024Updated last year
- Text to Speech for Indic languages☆52Mar 23, 2022Updated 4 years ago
- Repository for code and dataset for our EMNLP 2021 paper - “So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.☆15Sep 26, 2022Updated 3 years ago
- Generation of handwritten cyrillic text using fonts☆13Mar 27, 2023Updated 3 years ago
- ☆33Aug 22, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- OCR as a service☆17Dec 11, 2016Updated 9 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆22Aug 13, 2024Updated last year
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆55Feb 5, 2026Updated 2 months ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]☆16Apr 7, 2024Updated 2 years ago
- Carnatic singing voice separation trained with in-domain data with leakage☆11Nov 5, 2023Updated 2 years ago
- ☆24May 5, 2022Updated 3 years ago
- Language Identification for Indian languages☆34Dec 2, 2025Updated 5 months ago
- create dataset from list of youtube links easily☆22Apr 18, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 9 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- A summary of my work done during the Google Summer of Code(GSoC)'21 at the organization React Native Elements.☆14Aug 19, 2021Updated 4 years ago
- ☆19Mar 15, 2023Updated 3 years ago
- PageView wrapper that supports Hero-like animations☆23May 27, 2022Updated 3 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 7 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 22, 2026Updated last week
- [ICLR 2025 Spotlight] Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model☆16Apr 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Diffusers++: State-of-the-art diffusion models for image and audio generation in PyTorch☆14Sep 18, 2024Updated last year
- ☆15Dec 12, 2022Updated 3 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- ModelQ is a lightweight, battle-tested Python library for scheduling and queuing machine learning inference tasks. It's designed as a fas…☆18Apr 26, 2026Updated last week
- Malayalam Corpus by Swathanthra Malayalam Computing☆20Apr 2, 2023Updated 3 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- Generate and morph between checkfaces☆22Apr 19, 2026Updated 2 weeks ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VTrick template resource☆19Nov 8, 2022Updated 3 years ago
- Lightning-YOLOs provides clean, modular YOLO object detection models built on PyTorch Lightning, making it easier to train, extend, and e…☆34Jan 19, 2026Updated 3 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆66Aug 24, 2025Updated 8 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆14Aug 19, 2024Updated last year
- Repository contains various Malayalam ASR based resources curated from multiple sources☆18Oct 1, 2021Updated 4 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆97Jul 4, 2024Updated last year