A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 2 years ago
Alternatives and similar repositories for MFARunner
Users that are interested in MFARunner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Apr 12, 2021Updated 4 years ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 4 years ago
- Implementation of Korean FastSpeech2☆215Jan 29, 2023Updated 3 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Feb 27, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- MFA acoustic model training based on Opencpop☆15Sep 23, 2022Updated 3 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆57Mar 12, 2024Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Korean language support for NNSVS/ENUNU☆28Apr 3, 2024Updated last year
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆18Jan 17, 2022Updated 4 years ago
- ☆25Aug 31, 2024Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Singing Voice Synthesis based on VITS, different from VISinger☆196Nov 13, 2023Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- ☆163Sep 19, 2022Updated 3 years ago
- Various Text-to-speech (TTS) papers based on Deep-learning☆14Feb 26, 2021Updated 5 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆71Aug 8, 2022Updated 3 years ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆136Feb 18, 2023Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆147Aug 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆55Jan 13, 2023Updated 3 years ago
- Collect Voice Conversion researches☆96Mar 17, 2026Updated last week
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Tacotron2 for Korean (taKotron2)☆34Apr 8, 2022Updated 3 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- Updated folk of g2pk☆13Aug 18, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago