0417keito / JEN-1-pytorchView external linksLinks
Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.04729)
☆54Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for JEN-1-pytorch
Users that are interested in JEN-1-pytorch are comparing it to the libraries listed below
Sorting:
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- Official source codes of airsep☆39Mar 26, 2024Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆32Apr 22, 2024Updated last year
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆28Dec 19, 2024Updated last year
- ISMIR 24 Supplementary Material☆14Oct 28, 2024Updated last year
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Dec 12, 2022Updated 3 years ago
- code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)☆18Jan 29, 2024Updated 2 years ago
- Mustango: Toward Controllable Text-to-Music Generation☆388Jun 2, 2025Updated 8 months ago
- This is the official repository for M2UGen☆511Jan 2, 2025Updated last year
- Code and demo for paper: Zhao et al., "Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement," IJCAI 202…☆20May 2, 2024Updated last year
- Where is the "main theme" in an orchestral score?☆12Oct 25, 2025Updated 3 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- MU-LLaMA: Music Understanding Large Language Model☆302Aug 18, 2025Updated 5 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆75Jan 25, 2026Updated 3 weeks ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 10 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆29Sep 11, 2025Updated 5 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆187May 29, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Llambada: Simple Text Controllable for accompaniment generation☆37Sep 24, 2025Updated 4 months ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 8 months ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- music generation with masked transformers!☆350May 16, 2025Updated 9 months ago
- ☆84Oct 20, 2024Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆42Oct 7, 2024Updated last year
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- An Open-source Gufeng Melody and Chord Dataset☆15May 10, 2023Updated 2 years ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Jul 14, 2024Updated last year
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Aug 12, 2023Updated 2 years ago
- The latent diffusion model for text-to-music generation.☆185Jan 26, 2024Updated 2 years ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆51Jul 28, 2025Updated 6 months ago
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆343Apr 8, 2024Updated last year
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Sep 14, 2023Updated 2 years ago