☆56Dec 19, 2022Updated 3 years ago
Alternatives and similar repositories for ASR2022
Users that are interested in ASR2022 are comparing it to the libraries listed below
Sorting:
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 10 months ago
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- A JAX library for building lattice-based speech transducer models☆46Feb 27, 2026Updated last week
- ☆17Apr 14, 2023Updated 2 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- ☆23Jan 21, 2022Updated 4 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆170Jan 7, 2026Updated last month
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆50Feb 21, 2024Updated 2 years ago
- ☆19Nov 4, 2022Updated 3 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 4 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 8 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- DysfluentWFST☆18Nov 13, 2025Updated 3 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated this week
- Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …☆24Jun 13, 2023Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆157Feb 15, 2020Updated 6 years ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago