XARES-LLM
☆54Mar 26, 2026Updated this week
Alternatives and similar repositories for xares-llm
Users that are interested in xares-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark for evaluating audio encoders on various audio tasks.☆49Dec 11, 2025Updated 3 months ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- Official Implementation of GLAP - General Language Audio Pretraining☆66Updated this week
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated last month
- 🤗 R1-AQA Model: mispeech/r1-aqa☆318Mar 28, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆38Feb 5, 2026Updated last month
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆33Nov 12, 2025Updated 4 months ago
- The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…☆22Mar 24, 2017Updated 9 years ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆131Sep 2, 2025Updated 6 months ago
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆44Mar 19, 2026Updated last week
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆88Feb 3, 2026Updated last month
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ICSD Dataset☆41Jun 11, 2025Updated 9 months ago
- ☆30Jan 22, 2026Updated 2 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆117Updated this week
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Accompanying code for our paper "Point Cloud Audio Processing"☆18Jul 1, 2021Updated 4 years ago
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.☆11Nov 15, 2025Updated 4 months ago
- ☆15Nov 10, 2025Updated 4 months ago
- This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…☆12Nov 4, 2022Updated 3 years ago
- Code release for Unsupervised Domain Adaptation via Distilled Discriminative Clustering published by Pattern Recognition in 2022☆11May 19, 2023Updated 2 years ago
- the code for 'Global HRTF Personalization Using Anthropometric Measures'(AES 150th convention)☆35Jul 24, 2022Updated 3 years ago
- ☆19Aug 16, 2025Updated 7 months ago
- ☆17Jul 14, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 5 months ago
- ☆11Nov 18, 2021Updated 4 years ago
- Audio Processing & Visualization Concepts☆12Jun 20, 2023Updated 2 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- ☆11Feb 14, 2025Updated last year