finos / greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data
β30Updated 2 years ago
Alternatives and similar repositories for greenkey-asrtoolkit:
Users that are interested in greenkey-asrtoolkit are comparing it to the libraries listed below
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- β74Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 2 years ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β43Updated 3 years ago
- BurrMill coreβ21Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 4 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago
- Program to benchmark various speech recognition APIsβ80Updated 5 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Labeled data for homograph disambiguationβ55Updated last year
- A TensorFlow Implementation of Punctuation Restoration.β18Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 5 years ago
- An online speech recognition extension toolkit of Kaldiβ56Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 7 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 3 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis latticesβ16Updated 11 months ago
- Python library for handling audio datasets.β136Updated last year
- Multistream CNN for Robust Acoustic Modelingβ40Updated 3 years ago
- NMT based punctuation prediction system using lexical and acoustic features .β14Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β34Updated 6 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year