adefossez / audio_mod_idessai
Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆12Updated 5 months ago
Alternatives and similar repositories for audio_mod_idessai:
Users that are interested in audio_mod_idessai are comparing it to the libraries listed below
- Frechet Audio Distance evaluation in PyTorch☆36Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated 2 months ago
- ☆16Updated 5 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- PyTorch version of Spotify's Basic Pitch☆33Updated 10 months ago
- ☆39Updated 3 months ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆19Updated 5 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆33Updated 2 months ago
- Supervised and unsupervised Concept-based explanation of pretrained music classifiers☆12Updated last year
- iSeparate library for the SDX2023 challenge☆13Updated last year
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆17Updated 2 months ago
- ☆43Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- Unconditional music synthesis using a diffusion model in the STFT domain☆12Updated 2 years ago
- ☆9Updated 8 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated last year
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated last month
- ☆13Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 8 months ago
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 9 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆24Updated 6 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆24Updated 10 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆11Updated 7 months ago