MontrealCorpusTools / Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
☆1,295Updated 2 months ago
Related projects: ⓘ
- g2p: English Grapheme To Phoneme Conversion☆790Updated last year
- Simple text to phones converter for multiple languages☆1,192Updated last month
- A collection of links and notes on forced alignment tools☆863Updated 2 years ago
- Tools for handling speech data in machine learning projects.☆932Updated this week
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,264Updated 3 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆546Updated 4 months ago
- List of speech synthesis papers.☆989Updated last year
- ☆894Updated last week
- Python interface for forced audio alignment using HTK and SoX☆331Updated 4 years ago
- Large, modern dataset for speech recognition☆629Updated 6 months ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆281Updated 10 months ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,541Updated 4 months ago
- Praat in Python, the Pythonic way☆1,051Updated last month
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,108Updated last week
- A Python wrapper for Kaldi☆991Updated last month
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆658Updated 2 years ago
- The Implementation of FastSpeech based on pytorch.☆856Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆1,902Updated last month
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆670Updated this week
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆928Updated 3 weeks ago
- In defence of metric learning for speaker recognition☆1,027Updated 5 months ago
- Phonetisaurus G2P☆446Updated 3 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆310Updated 9 months ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆900Updated 5 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆532Updated 2 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆464Updated 3 years ago
- 🐸 collection of TTS papers☆614Updated 2 months ago
- A Python wrapper for the high-quality vocoder "World"☆718Updated 10 months ago
- Python interface to the WebRTC Voice Activity Detector☆2,014Updated 2 months ago
- An Open Source Tools for Speaker Recognition☆590Updated last month