finos / greenkey-asrtoolkitLinks
A collection of useful tools for handling speech recognition data
β30Updated 3 years ago
Alternatives and similar repositories for greenkey-asrtoolkit
Users that are interested in greenkey-asrtoolkit are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β130Updated 4 years ago
- β76Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β106Updated 2 years ago
- A module for normalising text.β172Updated 4 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 4 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.β55Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 3 years ago
- Forced Alignments for Common Voiceβ32Updated 5 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ156Updated 5 years ago
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- DeepSpeech based forced alignment toolβ239Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β115Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- Command line tool to create corpora for Common Voiceβ78Updated last month
- Labeled data for homograph disambiguationβ63Updated 2 years ago
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).β62Updated 2 weeks ago
- A collection of basic python modules for spoken natural language processingβ55Updated 6 years ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 5 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text servicesβ58Updated last year
- A tool for automatic phoneme transcriptionβ159Updated 2 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ82Updated last year
- Advanced data structures for handling temporal segments with attached labels.β124Updated 4 months ago