thuhcsi/Contextual-Biasing-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thuhcsi/Contextual-Biasing-Dataset)

thuhcsi / Contextual-Biasing-Dataset

open-source Mandarian biased word dataset

☆14

Alternatives and similar repositories for Contextual-Biasing-Dataset

Users that are interested in Contextual-Biasing-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BriansIDP / WhisperBiasing
View on GitHub
☆88Jul 31, 2025Updated 11 months ago
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
MagicHub-io / CSASR_Challenge
View on GitHub
☆11Sep 26, 2022Updated 3 years ago
tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated last year
amazon-science / contextual-attention-nlm
View on GitHub
Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.
☆14Jul 25, 2023Updated 2 years ago
emonosuke / emoASR
View on GitHub
End-to-end MOdeling of ASR (Automatic Speech Recognition)
☆33Feb 16, 2023Updated 3 years ago
tzyll / ChineseHP
View on GitHub
Dataset for Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models in Interspeech 2024.
☆16Jul 4, 2024Updated 2 years ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
gengxuelong / wenet_LLM_from_ASLP
View on GitHub
wenet_LLM_from_ASLP
☆15Nov 26, 2024Updated last year
m3yrin / aligned-cross-entropy
View on GitHub
Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655
☆21Jul 25, 2024Updated last year
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
songweige / Dmoz-Dataset
View on GitHub
content.rdf.u8.gz
☆11Dec 15, 2020Updated 5 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
flageval-baai / ChildMandarin
View on GitHub
[ACL 2025 Main] A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5
☆55Mar 19, 2025Updated last year
PRIS-CV / AutoDriveRL
View on GitHub
☆19Jun 13, 2025Updated last year
Impression2805 / OpenMix
View on GitHub
PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"
☆28Oct 16, 2023Updated 2 years ago
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
worldarena / WorldArena
View on GitHub
the official repository of the WorldArena benchmark
☆15Mar 23, 2026Updated 4 months ago
kehanlu / DeSTA2
View on GitHub
Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"
☆127Jul 15, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HeimingX / TAG
View on GitHub
Official code for Attention-driven GUI Grounding, AAAI2025
☆15Dec 17, 2024Updated last year
jymh / SAP2-ASR
View on GitHub
☆26Jan 23, 2026Updated 6 months ago
zxzhao0 / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆49Mar 3, 2025Updated last year
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
jonflynng / qwen2-audio-finetune
View on GitHub
Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.
☆24Nov 23, 2024Updated last year
maple-research-lab / RemeDi
View on GitHub
Official inference implementation of the paper "DON'T SETTLE TOO EARLY: SELF-REFLECTIVE REMASKING FOR DIFFUSION LANGUAGE MODELS". [ICLR 2…
☆15Jan 28, 2026Updated 5 months ago
maru0014 / AutoKitting
View on GitHub
PowerShell によって Windows10 のキッティングに必要な全工程を自動的に完了。
☆12Jun 10, 2025Updated last year
Qinying-Liu / Awesome-omni-modal-understanding
View on GitHub
Collection of papers about video-audio understanding
☆25Dec 26, 2025Updated 6 months ago
MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines
View on GitHub
The baselines of ARC-Challenge-Interspeech2026
☆60Dec 1, 2025Updated 7 months ago
petronny / g2p
View on GitHub
Pre-trained grapheme-to-phoneme (G2P) models
☆26Jul 27, 2021Updated 4 years ago
metame-ai / faster-distil-whisper
View on GitHub
Faster distil-whisper transcription with CTranslate2
☆14Jan 23, 2024Updated 2 years ago
kkatahira / cmbd-book
View on GitHub
「行動データの計算論モデリング」のサポートページです。
☆11Mar 1, 2021Updated 5 years ago
yoongi43 / MGE-LDM
View on GitHub
Official implementation of the paper MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
☆20Feb 19, 2026Updated 5 months ago
R1ckShi / SeACo-Paraformer
View on GitHub
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆44Mar 15, 2024Updated 2 years ago
anthony-wss / glm-4-voice-finetune
View on GitHub
☆14Apr 4, 2025Updated last year