A song aesthetic evaluation toolkit trained on SongEval.
☆288Jun 15, 2025Updated 9 months ago
Alternatives and similar repositories for SongEval
Users that are interested in SongEval are comparing it to the libraries listed below
Sorting:
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆317Aug 4, 2025Updated 7 months ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆482Nov 23, 2025Updated 3 months ago
- A Massive Contextual Speech Recognition Benchmark.☆104Aug 6, 2025Updated 7 months ago
- ☆110Oct 16, 2025Updated 5 months ago
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆117Feb 6, 2026Updated last month
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆227Feb 26, 2026Updated 3 weeks ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated last year
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆2,268Nov 27, 2025Updated 3 months ago
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆232Jan 8, 2026Updated 2 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆229May 11, 2025Updated 10 months ago
- Audio-FLAN☆160Sep 23, 2025Updated 5 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated last year
- ☆156Nov 22, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Llasa Speed Up☆61Jan 18, 2026Updated 2 months ago
- Unified automatic quality assessment for speech, music, and sound.☆694Jun 5, 2025Updated 9 months ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆151Nov 9, 2025Updated 4 months ago
- Official repository for the WenetSpeech-Chuan dataset.☆164Feb 5, 2026Updated last month
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆199Feb 25, 2026Updated 3 weeks ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Jul 14, 2024Updated last year
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆153May 30, 2025Updated 9 months ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆295Oct 12, 2025Updated 5 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- State-of-the-art pretrained music models for training, evaluation, inference☆166Jan 20, 2026Updated 2 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 10 months ago
- ☆36Sep 6, 2025Updated 6 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 10 months ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆438May 25, 2025Updated 9 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 9 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆33Apr 22, 2024Updated last year
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆68Dec 23, 2025Updated 2 months ago
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆29Feb 8, 2026Updated last month
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Nov 10, 2025Updated 4 months ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆225Aug 6, 2025Updated 7 months ago
- Official code for SongEcho☆52Mar 3, 2026Updated 2 weeks ago
- ☆15Aug 22, 2025Updated 6 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆422Feb 12, 2026Updated last month