talonvoice / wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
☆23Updated 3 years ago
Related projects: ⓘ
- BurrMill core☆21Updated 2 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆21Updated 5 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Updated 9 years ago
- Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation☆15Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 weeks ago
- A JAX library for building lattice-based speech transducer models☆39Updated 5 months ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- Coqui Inference Engine☆38Updated 3 years ago
- Tools for working with the CMU Pronunciation Dictionary☆34Updated 7 years ago
- ☆22Updated 2 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated 9 months ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- Text normalization scripts from IRISA lab☆12Updated 6 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆41Updated 3 years ago
- Multilingual Grapheme to Phoneme☆47Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Gentle and praatio scripts for easy forced alignment☆18Updated last year
- Code for AccentDB.☆20Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆23Updated 3 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆20Updated 5 years ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆15Updated 2 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Updated 6 years ago
- Command line tool to create corpora for Common Voice☆75Updated 3 months ago