SonyCSLParis / music2latentLinks
Encode and decode audio samples to/from compressed latent representations!
☆211Updated 3 months ago
Alternatives and similar repositories for music2latent
Users that are interested in music2latent are comparing it to the libraries listed below
Sorting:
- A simple library for Fréchet Audio Distance (FAD) calculation☆211Updated last week
- Self-supervised learning for fast pitch estimation☆232Updated 3 months ago
- ☆201Updated last year
- ☆166Updated last year
- Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention☆108Updated last year
- [PyTorch] Minimal codebase for MusicGen models☆60Updated 4 months ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆117Updated last year
- Fine-tune Stable Audio Open with DiT ControlNet.☆225Updated 2 weeks ago
- Results and Models for Learning Audio Representations of Music Content☆98Updated 5 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆140Updated 2 years ago
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- Pitch Estimating Neural Networks (PENN)☆253Updated 2 months ago
- A DDSP-based neural voice synthesiser.☆117Updated 6 months ago
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆81Updated 10 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Updated last year
- A lightweight library for Frechet Audio Distance calculation.☆273Updated 8 months ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆188Updated 10 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆150Updated last year
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆122Updated 3 months ago
- Unofficial download repository for MusicCaps☆47Updated 2 years ago
- ☆82Updated 2 years ago
- Code for paper: "Deep Embeddings and Section Fusion Improve Music Segmentation"☆53Updated 2 years ago
- Headless multitrack mixing console in Python☆118Updated 2 years ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆38Updated last year
- Full models and training code for PESTO☆66Updated 11 months ago
- Models and datasets for training deep learning automatic mixing models☆100Updated 9 months ago
- Unofficial implementation of SpecTNT in pytorch☆45Updated 2 years ago
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆52Updated last year
- Accurate and general beat tracker☆137Updated 3 months ago
- The latent diffusion model for text-to-music generation.☆169Updated last year