XARES-LLM
☆55Mar 26, 2026Updated last month
Alternatives and similar repositories for xares-llm
Users that are interested in xares-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark for evaluating audio encoders on various audio tasks.☆53Apr 27, 2026Updated last week
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆78Jun 16, 2025Updated 10 months ago
- Official Implementation of GLAP - General Language Audio Pretraining☆70Mar 25, 2026Updated last month
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 2 months ago
- 🤗 R1-AQA Model: mispeech/r1-aqa☆323Mar 28, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆39Feb 5, 2026Updated 3 months ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆35Nov 12, 2025Updated 5 months ago
- The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…☆22Mar 24, 2017Updated 9 years ago
- ☆18Jun 24, 2025Updated 10 months ago
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆133Sep 2, 2025Updated 8 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆95Apr 3, 2026Updated last month
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…☆52Apr 17, 2026Updated 3 weeks ago
- ICSD Dataset☆41Jun 11, 2025Updated 10 months ago
- ☆30Apr 29, 2026Updated last week
- ☆12Jul 6, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- Official repo and evaluation implementation of KnowRecall and VisRecall☆10May 22, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆120Updated this week
- Accompanying code for our paper "Point Cloud Audio Processing"☆18Jul 1, 2021Updated 4 years ago
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.☆11Nov 15, 2025Updated 5 months ago
- ☆15Nov 10, 2025Updated 5 months ago
- This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…☆12Nov 4, 2022Updated 3 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆30Apr 20, 2026Updated 2 weeks ago
- Code release for Unsupervised Domain Adaptation via Distilled Discriminative Clustering published by Pattern Recognition in 2022☆11May 19, 2023Updated 2 years ago
- the code for 'Global HRTF Personalization Using Anthropometric Measures'(AES 150th convention)☆35Jul 24, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Apr 16, 2026Updated 3 weeks ago
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- ☆17Jul 14, 2023Updated 2 years ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 6 months ago
- ☆10Nov 18, 2021Updated 4 years ago
- Audio Processing & Visualization Concepts☆12Jun 20, 2023Updated 2 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago