huggingface / open_asr_leaderboardView external linksLinks
☆174Updated this week
Alternatives and similar repositories for open_asr_leaderboard
Users that are interested in open_asr_leaderboard are comparing it to the libraries listed below
Sorting:
- ☆323Jun 14, 2024Updated last year
- Various speech datasets made available to the public☆130Dec 13, 2024Updated last year
- ☆18Sep 19, 2023Updated 2 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- ☆20Jan 21, 2026Updated 3 weeks ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 8 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆112Aug 4, 2023Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated 10 months ago
- ☆24Jan 14, 2021Updated 5 years ago
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated 7 months ago
- ☆53Oct 17, 2023Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- ☆40May 4, 2024Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆62Jan 14, 2026Updated last month
- Acoustic Neighbor Embeddings☆29Jul 13, 2025Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- ☆11Jun 22, 2023Updated 2 years ago
- A toolkit for processing speech data and creating speech datasets☆200Feb 6, 2026Updated last week
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆147May 18, 2025Updated 8 months ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated last year
- This repository contains the SpeechBrain Benchmarks☆137Feb 3, 2026Updated last week
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,305Nov 19, 2025Updated 2 months ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆467Jul 13, 2023Updated 2 years ago
- ☆11Nov 7, 2024Updated last year
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 7 months ago
- ☆12Apr 26, 2025Updated 9 months ago
- Open-source reproducible benchmarks from Argmax☆77Jan 19, 2026Updated 3 weeks ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆45May 13, 2025Updated 9 months ago
- multilingual speech aligner☆76Nov 19, 2023Updated 2 years ago