talonvoice / wav2letter
Facebook AI Research Automatic Speech Recognition Toolkit
☆23Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for wav2letter
- BurrMill core☆21Updated 3 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆20Updated 5 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Updated 7 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 4 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- RNNs for Text Normalization☆38Updated 6 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Self-contained Python package for OpenFst☆50Updated last year
- automatically align transcribed audio and generate a wav2letter training corpus☆35Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆15Updated 2 years ago
- Text normalization scripts from IRISA lab☆12Updated 6 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Updated 9 years ago
- ☆17Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- Covering grammars for English and Russian text normalization☆60Updated 5 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆27Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- ☆22Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year