Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.
☆18Feb 9, 2024Updated 2 years ago
Alternatives and similar repositories for Fastspeech2_MFA
Users that are interested in Fastspeech2_MFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-to-Speech for languages of India☆378Nov 8, 2024Updated last year
- Text to Speech for Indic languages☆53Mar 23, 2022Updated 4 years ago
- Repository for code and dataset for our EMNLP 2021 paper - “So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.☆15Sep 26, 2022Updated 3 years ago
- ☆35Aug 22, 2024Updated last year
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- OCR as a service☆17Dec 11, 2016Updated 9 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆23Aug 13, 2024Updated last year
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆57Feb 5, 2026Updated 4 months ago
- ☆24May 5, 2022Updated 4 years ago
- Language Identification for Indian languages☆36Dec 2, 2025Updated 7 months ago
- create dataset from list of youtube links easily☆23Apr 18, 2023Updated 3 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 11 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 9 months ago
- [ICLR 2025 Spotlight] Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model☆16Apr 23, 2025Updated last year
- Diffusers++: State-of-the-art diffusion models for image and audio generation in PyTorch☆14Sep 18, 2024Updated last year
- An attempt to recognise raga of a Carnatic song.☆11Dec 24, 2022Updated 3 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- ModelQ is a lightweight, battle-tested Python library for scheduling and queuing machine learning inference tasks. It's designed as a fas…☆18Jun 23, 2026Updated last week
- Malayalam Corpus by Swathanthra Malayalam Computing☆20Apr 2, 2023Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆15Apr 22, 2026Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generate and morph between checkfaces☆22Updated this week
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆19Jul 17, 2023Updated 2 years ago
- image retrieval/tagging with CLIP☆13Jul 13, 2024Updated last year
- VTrick template resource☆18Nov 8, 2022Updated 3 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆67Aug 24, 2025Updated 10 months ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Repository contains various Malayalam ASR based resources curated from multiple sources☆18Oct 1, 2021Updated 4 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆98Jul 4, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆11Feb 20, 2025Updated last year
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Oct 18, 2024Updated last year
- Elixir bindings to Kokoro-82M text-to-speech model☆20Mar 4, 2025Updated last year
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- ☆10Apr 8, 2024Updated 2 years ago