edemattos/asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/edemattos/asr)

edemattos / asr

Automatic Speech Recognition at the University of Edinburgh.

☆16

Alternatives and similar repositories for asr

Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 3 months ago
wentaozhu / speechnas
View on GitHub
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Mar 24, 2023Updated 3 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
nmfisher / sherpa_onnx_dart
View on GitHub
Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter
☆22Jan 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
albanie / LearningGrimacesByWatchingTV
View on GitHub
Code to accompany the paper "Learning Grimaces By Watching TV" and FaceValue dataset
☆12Aug 4, 2018Updated 7 years ago
groadabike / Kaldi-Dsing-task
View on GitHub
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Jul 9, 2026Updated 2 weeks ago
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆215Jul 10, 2026Updated 2 weeks ago
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
csukuangfj / icefall
View on GitHub
☆11Jul 16, 2026Updated last week
winlinvip / ai-translation
View on GitHub
This solution is not good enough, we're researching a better version: https://github.com/winlinvip/vod-translator so we archive this repo…
☆21Apr 17, 2024Updated 2 years ago
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
Enescigdem / SignLanguageRecognizer
View on GitHub
☆16Nov 8, 2020Updated 5 years ago
jimbozhang / yesno-example-for-undergraduates
View on GitHub
☆30Nov 17, 2022Updated 3 years ago
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
k2-fsa / multi_quantization
View on GitHub
☆46Nov 2, 2023Updated 2 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PINTO0309 / sne4onnx
View on GitHub
A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…
☆17Feb 24, 2026Updated 5 months ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
inclusionAI / Ming-Freeform-Audio-Edit
View on GitHub
☆15Oct 27, 2025Updated 8 months ago
mfischer-ucl / metappearance
View on GitHub
Metappearance: Meta-Learning for Visual Appearance Reproduction
☆22Sep 19, 2022Updated 3 years ago
k2-fsa / next-gen-kaldi-wechat
View on GitHub
☆40Oct 16, 2025Updated 9 months ago
awni / automata_ml
View on GitHub
An Introduction to Weighted Automata in Machine Learning
☆64Sep 3, 2022Updated 3 years ago
luomingshuang / M3GPT
View on GitHub
M3GPT: An advanced multimodal, multitask framework for motion comprehension and generation.
☆23Dec 12, 2024Updated last year
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
EdVince / llm-cpp
View on GitHub
☆34Jul 23, 2024Updated 2 years ago
jiangzhiyu1016 / Android-Offline-Speech-Recognition
View on GitHub
Provide accurate offline voice-to-text services for VR,AR and Android platforms, such as oculus quest1/2/pro or pico3/4
☆26May 21, 2024Updated 2 years ago
TeaPoly / Conformer-Athena
View on GitHub
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Nov 2, 2022Updated 3 years ago
JuanFMontesinos / Acappella-YNet
View on GitHub
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆18May 14, 2022Updated 4 years ago
leto19 / WhiSQA
View on GitHub
Whisper Speech Quality Assessment (WhiSQA)
☆16Apr 14, 2026Updated 3 months ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago