JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆154Aug 7, 2025Updated 7 months ago
Alternatives and similar repositories for jamify
Users that are interested in jamify are comparing it to the libraries listed below
Sorting:
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆308Nov 5, 2025Updated 4 months ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆295Oct 12, 2025Updated 5 months ago
- ☆156Nov 22, 2024Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆50Jul 28, 2025Updated 7 months ago
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆762Dec 4, 2025Updated 3 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- ☆55Dec 24, 2025Updated 2 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Sep 12, 2024Updated last year
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 9 months ago
- MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection☆26May 29, 2025Updated 9 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards☆94Jan 11, 2026Updated 2 months ago
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".☆317Aug 4, 2025Updated 7 months ago
- Encode and decode audio samples to/from compressed latent representations!☆250Sep 19, 2025Updated 6 months ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 5 months ago
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Aug 23, 2024Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆39Feb 24, 2025Updated last year
- Official source codes of coco-mulla☆36Mar 21, 2024Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- ☆39Apr 15, 2024Updated last year
- Mustango: Toward Controllable Text-to-Music Generation☆387Jun 2, 2025Updated 9 months ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆44Oct 30, 2025Updated 4 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- ☆22Nov 25, 2025Updated 3 months ago
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆229May 11, 2025Updated 10 months ago
- Code and Dataset for <Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases, ISMIR 2024>☆14Nov 12, 2024Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆220Apr 14, 2025Updated 11 months ago
- Tunee is your AI Music Partner. Create songs by chatting, manage multiple projects visually, and turn your music into stunning cinematic …☆37Sep 17, 2025Updated 6 months ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- Audio production style transfer with inference-time optimization☆49Nov 18, 2024Updated last year
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆154Dec 8, 2025Updated 3 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆241Dec 18, 2025Updated 3 months ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆2,268Nov 27, 2025Updated 3 months ago