Forced Alignment-MFA
☆50Jun 13, 2022Updated 4 years ago
Alternatives and similar repositories for Forced-Alignment-MFA
Users that are interested in Forced-Alignment-MFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 6 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ☆29Jun 15, 2022Updated 4 years ago
- Command line utility for forced alignment using Kaldi☆1,831Mar 31, 2026Updated 2 months ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆36Mar 22, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 11 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆197Mar 31, 2026Updated 2 months ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆16Mar 13, 2023Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 7 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆73Dec 23, 2025Updated 5 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆41Feb 5, 2026Updated 4 months ago
- ☆11May 7, 2022Updated 4 years ago
- Code for EMNLP2021 paper “Transductive Learning for Unsupervised Text Style Transfer”☆12Sep 19, 2021Updated 4 years ago
- ☆12Mar 11, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆37Aug 30, 2025Updated 9 months ago
- Speech-end detection library, based on WebRTC's VAD engine☆26May 10, 2025Updated last year
- ☆20May 23, 2025Updated last year
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 5 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- ☆11Mar 13, 2024Updated 2 years ago
- ☆76Apr 26, 2022Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Codes of the paper: * Zhen-Hua Ling , Yang Ai, Yu Gu, and Li-Rong Dai, "Waveform Modeling and Generation Using Hierarchical Recurrent Neu…☆27May 25, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- IndexTTS Fine-tuning notebooks☆139Jun 17, 2025Updated 11 months ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆71Apr 27, 2026Updated last month
- ☆17Aug 9, 2023Updated 2 years ago
- ☆11Nov 23, 2021Updated 4 years ago
- ☆13Nov 2, 2020Updated 5 years ago
- Tool for aligning Chinese transcripts with audio using the AWS transcribe service☆16Jun 25, 2022Updated 3 years ago
- Performance-oriented implementation of independent vector analysis for blind source separation.☆26Mar 26, 2020Updated 6 years ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆497Nov 23, 2025Updated 6 months ago
- ☆32Feb 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Estimate the fundamental frequency and inharmonicity coefficient of an isolated piano note☆11Jan 1, 2018Updated 8 years ago
- ☆10Jun 6, 2023Updated 3 years ago
- opus编码转mp3编码工具☆10Jul 17, 2025Updated 10 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated 2 years ago
- ☆61Jun 15, 2025Updated last year
- ☆23Feb 2, 2026Updated 4 months ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 7 years ago