A song aesthetic evaluation toolkit trained on SongEval.
☆301Apr 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for SongEval
Users that are interested in SongEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆330Aug 4, 2025Updated 8 months ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆488Nov 23, 2025Updated 5 months ago
- A Massive Contextual Speech Recognition Benchmark.☆105Aug 6, 2025Updated 8 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆47Jan 23, 2025Updated last year
- ☆131Oct 16, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆143Feb 6, 2026Updated 2 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated last year
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆240Feb 26, 2026Updated 2 months ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆2,289Nov 27, 2025Updated 5 months ago
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆266Jan 8, 2026Updated 3 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆234May 11, 2025Updated 11 months ago
- Audio-FLAN☆160Sep 23, 2025Updated 7 months ago
- Unified automatic quality assessment for speech, music, and sound.☆708Jun 5, 2025Updated 10 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆158Nov 22, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Llasa Speed Up☆62Jan 18, 2026Updated 3 months ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆152Nov 9, 2025Updated 5 months ago
- Official repository for the WenetSpeech-Chuan dataset.☆176Feb 5, 2026Updated 2 months ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆205Feb 25, 2026Updated 2 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 11 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆51Jul 14, 2024Updated last year
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆154May 30, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆297Oct 12, 2025Updated 6 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- State-of-the-art pretrained music models for training, evaluation, inference☆174Jan 20, 2026Updated 3 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 11 months ago
- ☆36Sep 6, 2025Updated 7 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated last year
- ☆76Sep 13, 2024Updated last year
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆456May 25, 2025Updated 11 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆48May 24, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated 2 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆71Dec 23, 2025Updated 4 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆57Nov 10, 2025Updated 5 months ago
- Official code for SongEcho☆59Mar 3, 2026Updated last month
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆230Apr 8, 2026Updated 3 weeks ago
- ☆15Apr 16, 2026Updated 2 weeks ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆425Feb 12, 2026Updated 2 months ago