finos / greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data
☆30Updated last year
Related projects ⓘ
Alternatives and complementary repositories for greenkey-asrtoolkit
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- BurrMill core☆21Updated 3 years ago
- ☆32Updated 2 months ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 8 months ago
- End-to-end spoken language identification out of the box.☆48Updated 3 years ago
- Text-to-Speech tutorial at SLTU 2016☆35Updated 8 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆33Updated 6 years ago
- A collection of basic python modules for spoken natural language processing☆56Updated 4 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- ☆74Updated 3 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 6 months ago
- automatically align transcribed audio and generate a wav2letter training corpus☆35Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Python library for handling audio datasets.☆131Updated last year