thu-spmi / ASR-BenchmarksLinks

An effort to track benchmarking results over widely-used datasets for ASR.

☆47

Alternatives and similar repositories for ASR-Benchmarks

Users that are interested in ASR-Benchmarks are comparing it to the libraries listed below

Sorting:

TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 2 years ago
tencent-ailab / 3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆118Updated 3 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆143Updated last year
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 7 months ago
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 5 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆61Updated 4 years ago
BUTSpeechFIT / AMI-diarization-setup
☆54Updated last year
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆30Updated 6 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
k2-fsa / text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆72Updated last month
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆110Updated 2 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆122Updated 3 years ago
desh2608 / dover-lap
Python package for combining diarization system outputs.
☆88Updated last year
csukuangfj / kaldilm
Python wrapper for kaldi's arpa2fst
☆38Updated 8 months ago
athena-team / athena-decoder
☆76Updated 3 years ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
idiap / acoustic-simulator
Implementation of audio degradation processes
☆103Updated 9 years ago
thuhcsi / FlatTN
Chinese Text Normalization and Dataset
☆84Updated 3 years ago
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
k2-fsa / multi_quantization
☆44Updated last year
NaoyukiKanda / LibriSpeechMix
☆36Updated 4 years ago
FFSVC / FFSVC2022_Baseline_System
☆32Updated 2 years ago
csukuangfj / optimized_transducer
Memory efficient transducer loss computation
☆68Updated 3 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆38Updated 5 years ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Updated 6 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆41Updated 2 years ago