Forced Alignment-MFA
☆50Jun 13, 2022Updated 3 years ago
Alternatives and similar repositories for Forced-Alignment-MFA
Users that are interested in Forced-Alignment-MFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- Command line utility for forced alignment using Kaldi☆1,796Mar 31, 2026Updated 2 weeks ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 9 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆193Mar 31, 2026Updated 2 weeks ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆16Mar 13, 2023Updated 3 years ago
- FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation☆79Mar 20, 2026Updated 3 weeks ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 3 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆38Feb 5, 2026Updated 2 months ago
- ☆11May 7, 2022Updated 3 years ago
- Generative Adaptive MIDI Extractor☆146Mar 29, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Mar 11, 2025Updated last year
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆27Mar 29, 2026Updated 3 weeks ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- This contains files for the Starcraft Campaign but the Player and AI are in reversed roles. (Some modifications are made too)☆13Jan 17, 2022Updated 4 years ago
- Speech-end detection library, based on WebRTC's VAD engine☆26May 10, 2025Updated 11 months ago
- Repository for Monash Psychology Honours Statistics Unit (PSY4210)☆17Feb 19, 2026Updated 2 months ago
- ☆19May 23, 2025Updated 10 months ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 5 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Mar 13, 2024Updated 2 years ago
- ☆77Apr 26, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Codes of the paper: * Zhen-Hua Ling , Yang Ai, Yu Gu, and Li-Rong Dai, "Waveform Modeling and Generation Using Hierarchical Recurrent Neu…☆27May 25, 2018Updated 7 years ago
- IndexTTS Fine-tuning notebooks☆137Jun 17, 2025Updated 10 months ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆68Jan 27, 2026Updated 2 months ago
- Generate word-word similarities from Gensim's latent semantic indexing (Python)☆11Jan 10, 2017Updated 9 years ago
- ☆16Aug 9, 2023Updated 2 years ago
- ☆11Nov 23, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Nov 2, 2020Updated 5 years ago
- Performance-oriented implementation of independent vector analysis for blind source separation.☆26Mar 26, 2020Updated 6 years ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆486Nov 23, 2025Updated 4 months ago
- Python package to enable the use of a Python-based API for EGI's NetStation EEG amplifier interface.☆13Nov 1, 2023Updated 2 years ago
- ☆30Feb 4, 2025Updated last year
- ☆10Jun 6, 2023Updated 2 years ago
- opus编码转mp3编码工具☆10Jul 17, 2025Updated 9 months ago