open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for Contextual-Biasing-Dataset
Users that are interested in Contextual-Biasing-Dataset are comparing it to the libraries listed below
Sorting:
- ☆86Jul 31, 2025Updated 7 months ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆80May 10, 2023Updated 2 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasonin…☆25Updated this week
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆27Jul 11, 2025Updated 7 months ago
- ☆17May 5, 2024Updated last year
- ☆13Mar 30, 2023Updated 2 years ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Jan 15, 2026Updated last month
- ☆15Jul 4, 2024Updated last year
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- PowerShell によって Windows10 のキッティングに必要な全工程を自動的に完了。☆12Jun 10, 2025Updated 8 months ago
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- Speech Emotion Recognition using Deep Learning☆12May 24, 2021Updated 4 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"☆70Aug 11, 2025Updated 6 months ago
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆54May 15, 2025Updated 9 months ago
- ☆11Oct 31, 2021Updated 4 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 9 months ago
- ☆10Oct 20, 2022Updated 3 years ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆22Feb 13, 2026Updated 2 weeks ago
- ☆53Dec 7, 2025Updated 2 months ago
- 「行動データの計算論モデリング」のサポートページです。☆11Mar 1, 2021Updated 5 years ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated 9 months ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- 基于中心度的中文关键短语抽取工具☆11Sep 2, 2022Updated 3 years ago
- ☆19Jul 22, 2025Updated 7 months ago
- content.rdf.u8.gz☆10Dec 15, 2020Updated 5 years ago
- ☆20Jun 13, 2025Updated 8 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- c++的一些基础知识总结☆10Oct 28, 2020Updated 5 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago