DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
Alternatives and similar repositories for DSAlign
Users that are interested in DSAlign are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of links and notes on forced alignment tools☆939Apr 18, 2026Updated last week
- Segment an audio file and obtain utterance alignments. (Python package)☆346May 15, 2024Updated last year
- gentle forced aligner☆1,691May 19, 2025Updated 11 months ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Apr 11, 2023Updated 3 years ago
- The People’s Speech Dataset☆113Jan 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Command line utility for forced alignment using Kaldi☆1,803Mar 31, 2026Updated last month
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,392Jun 6, 2024Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆36Aug 15, 2019Updated 6 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,828Jun 22, 2024Updated last year
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simple text to phones converter for multiple languages☆1,539Sep 26, 2024Updated last year
- A library for speech data augmentation in time-domain☆686Aug 30, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Mar 31, 2021Updated 5 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆410Apr 8, 2020Updated 6 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆939Sep 4, 2024Updated last year
- Basic wavenet and fftnet vocoder model.☆19Feb 7, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- A script for audio/transcript alignment. Fork of p2fa.☆69Mar 15, 2018Updated 8 years ago
- Charsiu: A neural phonetic aligner.☆341Sep 19, 2022Updated 3 years ago
- ☆17Apr 14, 2023Updated 3 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆409Jul 7, 2021Updated 4 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211May 30, 2025Updated 11 months ago
- g2p: English Grapheme To Phoneme Conversion☆918Jan 5, 2023Updated 3 years ago
- A fast, high-quality neural vocoder.☆298Jul 18, 2023Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Aug 6, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆116Dec 22, 2021Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82May 3, 2024Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 3 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Server framework for Kaldi ASR Toolkit☆99Sep 17, 2023Updated 2 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago