A song aesthetic evaluation toolkit trained on SongEval.
☆295Jun 15, 2025Updated 9 months ago
Alternatives and similar repositories for SongEval
Users that are interested in SongEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆323Aug 4, 2025Updated 8 months ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆483Nov 23, 2025Updated 4 months ago
- A Massive Contextual Speech Recognition Benchmark.☆105Aug 6, 2025Updated 8 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- ☆121Oct 16, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆126Feb 6, 2026Updated 2 months ago
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆235Feb 26, 2026Updated last month
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆17Mar 3, 2025Updated last year
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆2,277Nov 27, 2025Updated 4 months ago
- A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows☆246Jan 8, 2026Updated 3 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆231May 11, 2025Updated 10 months ago
- Audio-FLAN☆159Sep 23, 2025Updated 6 months ago
- Unified automatic quality assessment for speech, music, and sound.☆701Jun 5, 2025Updated 10 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆157Nov 22, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Llasa Speed Up☆62Jan 18, 2026Updated 2 months ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆151Nov 9, 2025Updated 5 months ago
- Official repository for the WenetSpeech-Chuan dataset.☆169Feb 5, 2026Updated 2 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 11 months ago
- [NeurIPS 2025] Benchmark data and code for MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix☆202Feb 25, 2026Updated last month
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Jul 14, 2024Updated last year
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆154May 30, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆296Oct 12, 2025Updated 5 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- State-of-the-art pretrained music models for training, evaluation, inference☆171Jan 20, 2026Updated 2 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 11 months ago
- ☆36Sep 6, 2025Updated 7 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 11 months ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆450May 25, 2025Updated 10 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 10 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆70Dec 23, 2025Updated 3 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆57Nov 10, 2025Updated 5 months ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆230Updated this week
- Official code for SongEcho☆55Mar 3, 2026Updated last month
- ☆15Aug 22, 2025Updated 7 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆424Feb 12, 2026Updated last month
- Robust Singing Voice Transcription and MIDI Extraction☆117Nov 20, 2024Updated last year