jimbozhang / xares-llm-templateLinks
Template for creating audio encoders compatible with X-ARES
☆11Updated this week
Alternatives and similar repositories for xares-llm-template
Users that are interested in xares-llm-template are comparing it to the libraries listed below
Sorting:
- A benchmark for evaluating audio encoders on various audio tasks.☆35Updated last month
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- ARCH: Audio Representations benCHmark☆52Updated last year
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆58Updated 2 months ago
- A toolkit dedicate for speech evaluation.☆24Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆54Updated 6 months ago
- Official Implementation of GLAP - General Language Audio Pretraining☆53Updated 5 months ago
- ☆16Updated 3 months ago
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆81Updated 2 months ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆67Updated 6 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆41Updated 5 months ago
- ☆49Updated 8 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆46Updated 6 months ago
- Exploring Binary Classification Loss for Speaker Verification☆18Updated 2 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆43Updated 6 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆60Updated 3 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆40Updated 5 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆76Updated 6 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- ☆58Updated last month
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆77Updated 4 months ago
- ☆63Updated last year
- The baselines of ARC-Challenge-Interspeech2026☆36Updated last week
- ☆56Updated last year
- The open-source code of UniAudio2.0☆73Updated 3 months ago
- ☆37Updated 4 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆57Updated 5 months ago