Repository for MIDI-GPT, a controllable multi-track music machine.
☆62Sep 30, 2025Updated 5 months ago
Alternatives and similar repositories for MIDI-GPT
Users that are interested in MIDI-GPT are comparing it to the libraries listed below
Sorting:
- ☆78Feb 6, 2026Updated 3 weeks ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆46May 24, 2025Updated 9 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆63Feb 19, 2025Updated last year
- MAX/MSP objects for audio and rhythmic synthesis using networks of coupled oscillators☆13May 5, 2023Updated 2 years ago
- ☆54Jul 16, 2025Updated 7 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆76Jun 19, 2025Updated 8 months ago
- Diffusion Network for MIDI Transformation☆15Jul 4, 2025Updated 7 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 9 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 10 months ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆50Feb 4, 2026Updated 3 weeks ago
- ☆20Oct 27, 2025Updated 4 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- A constrainable continuator. Applications to music (mostly), text, chords, etc.☆33Jan 15, 2026Updated last month
- Full-attention multi-instrumental music transformer featuring asymmetrical encoding with octo-velocity, and chords counters tokens, optim…☆48Dec 15, 2023Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- A library for computing Frechet Music Distance.☆28Feb 4, 2025Updated last year
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- ☆26Feb 17, 2026Updated 2 weeks ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Basic library for spatial audio SOFA files☆12Sep 29, 2020Updated 5 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"☆48Aug 23, 2025Updated 6 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Apr 1, 2024Updated last year
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆28Sep 13, 2025Updated 5 months ago
- Low-latency timbre transfer models for instrumental interaction.☆92Oct 10, 2025Updated 4 months ago
- Samples for dirt/superdirt ripped from my Roland JV-1080 digital synth☆13Dec 1, 2016Updated 9 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆22Jul 30, 2025Updated 7 months ago
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 11 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆80Sep 9, 2024Updated last year
- Official repository for GraFPrint: an audio identification framework based on graph neural networks.☆37Sep 18, 2025Updated 5 months ago
- Utu is a command-line program that uses the Loris library to analyze sounds.☆16Oct 11, 2022Updated 3 years ago
- Machine learning-powered music generation. Full-featured tokenizer, customization options, and high-quality output files.☆14Feb 3, 2025Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 3 months ago
- ☆32Nov 25, 2023Updated 2 years ago
- Composer's Assistant for REAPER☆64Jun 16, 2025Updated 8 months ago