open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for Contextual-Biasing-Dataset
Users that are interested in Contextual-Biasing-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- ☆88Jul 31, 2025Updated 9 months ago
- ☆11Sep 26, 2022Updated 3 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆17May 5, 2024Updated last year
- ☆20Jun 13, 2025Updated 10 months ago
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆40Apr 11, 2026Updated 3 weeks ago
- Accompanying code for paper "Attention-Based Contextual Language Model Adaptation for Speech Recognition", submitted to ACL 2021.☆14Jul 25, 2023Updated 2 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- Collection of papers about video-audio understanding☆25Dec 26, 2025Updated 4 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 9 months ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated last month
- ☆15Jul 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- ☆13Mar 30, 2023Updated 3 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 11 months ago
- wenet_LLM_from_ASLP☆15Nov 26, 2024Updated last year
- Test implementation of "Aligned Cross Entropy for Non-Autoregressive Machine Translation" https://arxiv.org/abs/2004.01655☆21Jul 25, 2024Updated last year
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 11 months ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆22Apr 1, 2022Updated 4 years ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- content.rdf.u8.gz☆11Dec 15, 2020Updated 5 years ago
- ☆22Apr 26, 2026Updated last week
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆47Mar 3, 2025Updated last year
- Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"☆126Jul 15, 2025Updated 9 months ago
- SpEx+(tied) source code☆94Jul 6, 2023Updated 2 years ago
- PyTorch implementation of our CVPR2023 paper "OpenMix: Exploring Out-of-Distribution samples for Misclassification Detection"☆27Oct 16, 2023Updated 2 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- ☆15Apr 4, 2025Updated last year
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆79Jan 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"☆75Aug 11, 2025Updated 8 months ago
- Pre-trained grapheme-to-phoneme (G2P) models☆26Jul 27, 2021Updated 4 years ago
- ☆31Jul 31, 2025Updated 9 months ago
- Faster distil-whisper transcription with CTranslate2☆14Jan 23, 2024Updated 2 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Apr 15, 2026Updated 2 weeks ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- ☆11Oct 20, 2022Updated 3 years ago