0417keito/JEN-1-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/0417keito/JEN-1-pytorch)

0417keito / JEN-1-pytorch

Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)

☆55

Alternatives and similar repositories for JEN-1-pytorch

Users that are interested in JEN-1-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

0417keito / JEN-1-COMPOSER-pytorch
View on GitHub
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…
☆32Jan 19, 2024Updated 2 years ago
asappresearch / simple-tts
View on GitHub
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆57Oct 31, 2023Updated 2 years ago
Kikyo-16 / airgen
View on GitHub
Official source codes of airsep
☆39Mar 26, 2024Updated 2 years ago
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
chenjianyi / fastsag
View on GitHub
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
☆29Dec 19, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
shansongliu / MuMu-LLaMA
View on GitHub
This is the official repository for M2UGen
☆513Jan 2, 2025Updated last year
AMAAI-Lab / mustango
View on GitHub
Mustango: Toward Controllable Text-to-Music Generation
☆394Jun 2, 2025Updated last year
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
yongyizang / AreYouReallyListening
View on GitHub
Official Repository for ISMIR 2025 paper "Are you really listening? Boosting Perceptual Awareness in Music-QA Benchmarks"
☆20Aug 18, 2025Updated 11 months ago
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year
shansongliu / MU-LLaMA
View on GitHub
MU-LLaMA: Music Understanding Large Language Model
☆306Aug 18, 2025Updated 11 months ago
seungheondoh / music-text-representation
View on GitHub
Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]
☆113Aug 12, 2023Updated 2 years ago
zcli-charlie / ZIQI-Eval
View on GitHub
ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models
☆18Jul 23, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ldzhangyx / BART-fusion
View on GitHub
The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".
☆24Dec 12, 2022Updated 3 years ago
seungheondoh / msd-subsets
View on GitHub
million song dataset split for extended clean tag & artist-level stratified
☆52Aug 12, 2023Updated 2 years ago
ismir-24-sub / unsupervised_compositional_representations
View on GitHub
ISMIR 24 Supplementary Material
☆14Oct 28, 2024Updated last year
ldzhangyx / instruct-MusicGen
View on GitHub
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…
☆109Jan 14, 2026Updated 6 months ago
seungheondoh / music_caps_dl
View on GitHub
Unofficial download repository for MusicCaps
☆47Apr 21, 2023Updated 3 years ago
NeuralNotW0rk / LoRAW
View on GitHub
Flexible LoRA Implementation to use with stable-audio-tools
☆84Sep 9, 2024Updated last year
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
guozixunnicolas / FundamentalMusicEmbedding
View on GitHub
☆32Nov 25, 2023Updated 2 years ago
bytedance / Make-An-Audio-2
View on GitHub
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆197May 29, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RetroCirce / MusicLDM
View on GitHub
The latent diffusion model for text-to-music generation.
☆188Jan 26, 2024Updated 2 years ago
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
seungheondoh / lp-music-caps
View on GitHub
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆348Apr 8, 2024Updated 2 years ago
jaeyeonkim99 / EnCLAP
View on GitHub
Official Implementation of EnCLAP (ICASSP 2024)
☆96Jun 2, 2024Updated 2 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
ilaria-manco / mulap
View on GitHub
Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)
☆47Dec 3, 2024Updated last year
drscotthawley / fad_pytorch
View on GitHub
Frechet Audio Distance evaluation in PyTorch
☆36Jun 9, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
seungheondoh / music-text-representation-pp
View on GitHub
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]
☆43Oct 7, 2024Updated last year
Kinyugo / msanii
View on GitHub
A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.
☆196Apr 27, 2023Updated 3 years ago
fundwotsai2001 / Text-to-music-dataset-preparation
View on GitHub
A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]
☆28May 20, 2025Updated last year
nii-yamagishilab / midi-to-audio
View on GitHub
Project for MIDI to Audio Synthesis
☆28Mar 13, 2023Updated 3 years ago
yoyolicoris / music-spectrogram-diffusion-pytorch
View on GitHub
☆88Jan 29, 2023Updated 3 years ago
0nutation / USLM
View on GitHub
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆152Sep 14, 2023Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago