xiaomi-research/xares-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaomi-research/xares-llm)

xiaomi-research / xares-llm

XARES-LLM

☆54

Alternatives and similar repositories for xares-llm

Users that are interested in xares-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jimbozhang / xares
View on GitHub
A benchmark for evaluating audio encoders on various audio tasks.
☆55Apr 27, 2026Updated 2 months ago
xiaomi-research / dasheng-denoiser
View on GitHub
Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…
☆81Jun 16, 2025Updated last year
xiaomi-research / dasheng-glap
View on GitHub
Official Implementation of GLAP - General Language Audio Pretraining
☆72May 14, 2026Updated last month
xiaomi-research / mecat
View on GitHub
☆42May 12, 2026Updated last month
jimbozhang / xares-llm-template
View on GitHub
Template for creating audio encoders compatible with X-ARES
☆19Feb 11, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
xiaomi-research / r1-aqa
View on GitHub
🤗 R1-AQA Model: mispeech/r1-aqa
☆325Mar 28, 2025Updated last year
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
urgent-challenge / urgent2026_challenge_track1
View on GitHub
Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.
☆36Nov 12, 2025Updated 7 months ago
IoSR-Surrey / IoSR_ListeningRoom_BRIRs
View on GitHub
The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…
☆22Mar 24, 2017Updated 9 years ago
nicolaus625 / CMI-bench
View on GitHub
☆18Jun 24, 2025Updated last year
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆141Sep 2, 2025Updated 10 months ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
frothywater / kanade-tokenizer
View on GitHub
Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…
☆102Jun 19, 2026Updated 2 weeks ago
QingyuLiu0521 / ICSD
View on GitHub
ICSD Dataset
☆42Jun 11, 2025Updated last year
kyutai-labs / tts_longeval
View on GitHub
☆30Apr 29, 2026Updated 2 months ago
tongshuangwu / llm-crowdsourcing-pipeline
View on GitHub
☆11Jul 6, 2023Updated 3 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
LudovicTuncay / Audio-JEPA
View on GitHub
Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…
☆62Apr 17, 2026Updated 2 months ago
semanticVAD / testsets
View on GitHub
Testing sets for semanticVAD
☆20Feb 18, 2025Updated last year
uiuctml / GOAT
View on GitHub
[JMLR] Gradual Domain Adaptation: Theory and Algorithms
☆11Jan 14, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
etzinis / optimal_condition_training
View on GitHub
Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…
☆14Feb 15, 2023Updated 3 years ago
nlp-waseda / traveling-across-languages
View on GitHub
Official repo and evaluation implementation of KnowRecall and VisRecall
☆10May 22, 2025Updated last year
qiuqiangkong / audioflow
View on GitHub
☆126Updated this week
SubramaniKrishna / point-cloud-audio
View on GitHub
Accompanying code for our paper "Point Cloud Audio Processing"
☆18Jul 1, 2021Updated 5 years ago
aascode / Speech-Emotion-Recognition-2
View on GitHub
Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别
☆10Jul 1, 2019Updated 7 years ago
CarlWangChina / REMAST-Real-time-Emotion-based-Music-Arrangement-with-Soft-Transition
View on GitHub
SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.
☆11Nov 15, 2025Updated 7 months ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 7 months ago
a791702141 / SSG
View on GitHub
This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…
☆12Nov 4, 2022Updated 3 years ago
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆31May 22, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
huitangtang / DisClusterDA
View on GitHub
Code release for Unsupervised Domain Adaptation via Distilled Discriminative Clustering published by Pattern Recognition in 2022
☆11May 19, 2023Updated 3 years ago
YuriWayne42 / hrtf_sht_personalization
View on GitHub
the code for 'Global HRTF Personalization Using Anthropometric Measures'(AES 150th convention)
☆36Jul 24, 2022Updated 3 years ago
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 2 months ago
iOPENCap / awesome-unimodal-training
View on GitHub
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
FinvDialect / 2023_finvcup_baseline
View on GitHub
☆17Jul 14, 2023Updated 2 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 8 months ago
bhuvanakundumani / SimCSE_unsupervised
View on GitHub
☆10Nov 18, 2021Updated 4 years ago