finos / greenkey-asrtoolkit
A collection of useful tools for handling speech recognition data
β30Updated 2 years ago
Alternatives and similar repositories for greenkey-asrtoolkit:
Users that are interested in greenkey-asrtoolkit are comparing it to the libraries listed below
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- Python library for handling audio datasets.β137Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- BurrMill coreβ21Updated 3 years ago
- Automatic Speech Recognition Dataset Generationβ37Updated 6 years ago
- β75Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.β18Updated 4 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 4 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 4 years ago
- Command line tool to create corpora for Common Voiceβ75Updated 10 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β44Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 10 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Swarah: Indian-English speech dataset collected across the countryβ29Updated last year
- Phonetically-Oriented Word Error Rateβ34Updated 5 years ago
- Labeled data for homograph disambiguationβ57Updated last year
- End to End Dialect Identification using Convolutional Neural Networkβ52Updated 5 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Covering grammars for English and Russian text normalizationβ60Updated 5 years ago