Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)
☆55Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for JEN-1-pytorch
Users that are interested in JEN-1-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆29Dec 19, 2024Updated last year
- ☆87Oct 20, 2024Updated last year
- Official source codes of airsep☆39Mar 26, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated last year
- This is the official repository for M2UGen☆514Jan 2, 2025Updated last year
- Mustango: Toward Controllable Text-to-Music Generation☆385Jun 2, 2025Updated 10 months ago
- Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"☆20Aug 18, 2025Updated 8 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆48Sep 11, 2024Updated last year
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- MU-LLaMA: Music Understanding Large Language Model☆305Aug 18, 2025Updated 8 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Aug 12, 2023Updated 2 years ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Dec 12, 2022Updated 3 years ago
- million song dataset split for extended clean tag & artist-level stratified☆52Aug 12, 2023Updated 2 years ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆107Jan 14, 2026Updated 3 months ago
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- Flexible LoRA Implementation to use with stable-audio-tools☆82Sep 9, 2024Updated last year
- Unofficial download repository for MusicCaps☆47Apr 21, 2023Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- The latent diffusion model for text-to-music generation.☆186Jan 26, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆193May 29, 2024Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆77Jan 25, 2026Updated 2 months ago
- music generation with masked transformers!☆350May 16, 2025Updated 11 months ago
- The open source code for LLM-Codec☆146Aug 18, 2024Updated last year
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆348Apr 8, 2024Updated 2 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆95Jun 2, 2024Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆27Apr 23, 2024Updated last year
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 5 years ago
- Llambada: Simple Text Controllable for accompaniment generation☆42Mar 14, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆45Jun 11, 2024Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 10 months ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- Where is the "main theme" in an orchestral score?☆14Oct 25, 2025Updated 5 months ago