smtiitm/Fastspeech2_MFA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/smtiitm/Fastspeech2_MFA)

smtiitm / Fastspeech2_MFA

Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrated with disability aids and various other applications.

☆18

Alternatives and similar repositories for Fastspeech2_MFA

Users that are interested in Fastspeech2_MFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI4Bharat / Indic-TTS
View on GitHub
Text-to-Speech for languages of India
☆378Nov 8, 2024Updated last year
BakerBunker / SALT
View on GitHub
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
☆23Aug 13, 2024Updated last year
yuboona / some-script-to-help-using-Montreal-Forced-Aligner
View on GitHub
Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…
☆14Feb 9, 2024Updated 2 years ago
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
MTG / carnatic-separation-ismir23
View on GitHub
Carnatic singing voice separation trained with in-domain data with leakage
☆11Nov 5, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
manmay-nakhashi / TTS_dataset_creator
View on GitHub
create dataset from list of youtube links easily
☆23Apr 18, 2023Updated 3 years ago
Tencent / SongBench
View on GitHub
☆51Apr 30, 2026Updated 2 months ago
lifeiteng / NotebookTTS
View on GitHub
Text-To-Speech for NotebookLM
☆39Jul 20, 2025Updated last year
parakalan / RagaRecognition
View on GitHub
An attempt to recognise raga of a Carnatic song.
☆12Dec 24, 2022Updated 3 years ago
Speech-Lab-IITM / data2vec-aqc
View on GitHub
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆13Mar 18, 2024Updated 2 years ago
daemyung / practice-triton
View on GitHub
삼각형의 실전! Triton
☆16Feb 15, 2024Updated 2 years ago
indic-ocr / ocrservice
View on GitHub
OCR as a service
☆17Dec 11, 2016Updated 9 years ago
kelechi-c / ripple_net
View on GitHub
image retrieval/tagging with CLIP
☆13Jul 13, 2024Updated 2 years ago
smtiitm / Fastspeech2_HS
View on GitHub
Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…
☆57Feb 5, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
smc / corpus
View on GitHub
Malayalam Corpus by Swathanthra Malayalam Computing
☆21Apr 2, 2023Updated 3 years ago
AI4Bharat / IndicVoices
View on GitHub
☆19Feb 22, 2026Updated 5 months ago
ganamod / vtrick
View on GitHub
VTrick template resource
☆18Nov 8, 2022Updated 3 years ago
check-face / facemorph.me
View on GitHub
Generate and morph between checkfaces
☆22Jun 27, 2026Updated 3 weeks ago
KD-TAO / MGFR
View on GitHub
[ICLR 2025 Spotlight] Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model
☆16Apr 23, 2025Updated last year
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 10 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
aswinpradeep / malayalam-asr-datasets
View on GitHub
Repository contains various Malayalam ASR based resources curated from multiple sources
☆18Oct 1, 2021Updated 4 years ago
webaverse / LJSpeechTools
View on GitHub
Tools to isolate speaker and transcribe unstructured audio clips
☆11Dec 4, 2022Updated 3 years ago
cogilab / Face
View on GitHub
Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)
☆10Nov 2, 2021Updated 4 years ago
leloykun / mmsg
View on GitHub
Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.
☆29Oct 18, 2024Updated last year
Harry-Yu-Shuhang / Step-Audio-tts
View on GitHub
☆11Feb 20, 2025Updated last year
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
IVY-LVLM / Counterfactual-Inception
View on GitHub
Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-mod…
☆20Sep 26, 2024Updated last year
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆98Jul 4, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sjchoi86 / intro-to-linear-algebra
View on GitHub
☆15Apr 4, 2023Updated 3 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
SWE-bench / reading-list
View on GitHub
Academic papers and works related to SWE-bench and SWE-agents
☆15Dec 8, 2025Updated 7 months ago
KanikeSaiPrakash / Speech-Emotion-Recognition
View on GitHub
Speech Emotion Recognition using Deep Learning
☆13May 24, 2021Updated 5 years ago
yunyikristy / skipNet
View on GitHub
☆12Oct 21, 2019Updated 6 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
Yoshifumi-Nakano / visual-text-to-speech
View on GitHub
visual-text to speech
☆14Apr 3, 2022Updated 4 years ago