☆57Dec 19, 2022Updated 3 years ago
Alternatives and similar repositories for ASR2022
Users that are interested in ASR2022 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆51Feb 21, 2024Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Mar 20, 2021Updated 5 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- ☆29May 3, 2023Updated 2 years ago
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 9 months ago
- ☆32Jan 6, 2022Updated 4 years ago
- Code designed for analysis of tongue contour data - produces three metrics (Procrustes analysis, Modified Curvature Index and Fourier ana…☆10Apr 19, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- A JAX library for building lattice-based speech transducer models☆47Mar 2, 2026Updated 3 weeks ago
- ☆20Sep 20, 2024Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated 11 months ago
- ☆12Jun 10, 2021Updated 4 years ago
- Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …☆24Jun 13, 2023Updated 2 years ago
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 weeks ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- ☆57Apr 18, 2023Updated 2 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- DysfluentWFST☆18Nov 13, 2025Updated 4 months ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆171Mar 12, 2026Updated last week
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Feb 23, 2023Updated 3 years ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated last month
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆29Feb 4, 2025Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆157Feb 15, 2020Updated 6 years ago