Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding
☆55Sep 25, 2025Updated 5 months ago
Alternatives and similar repositories for SLAP
Users that are interested in SLAP are comparing it to the libraries listed below
Sorting:
- Code of our ISMIR 2025 paper - D. Afchar, G. Meseguer Brocal, K. Akesbi, R. Hennequin☆34Nov 12, 2025Updated 3 months ago
- Encode and decode audio samples to/from continuous and discrete compressed representations!☆104Nov 25, 2025Updated 3 months ago
- ☆29Mar 19, 2025Updated 11 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations☆17Mar 31, 2025Updated 11 months ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆16Feb 1, 2026Updated last month
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆135Feb 3, 2025Updated last year
- Code for paper: "Deep Embeddings and Section Fusion Improve Music Segmentation"☆53Oct 10, 2022Updated 3 years ago
- Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differenti…☆14Apr 11, 2024Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- This is the codes repository for the paper "Emotion-Guided Music Accompaniment Generation based on VAE".☆13Oct 11, 2023Updated 2 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 5 months ago
- ☆15Apr 13, 2025Updated 10 months ago
- Training code for kokoro tts model☆34Nov 15, 2025Updated 3 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- ☆28Jul 7, 2025Updated 7 months ago
- ☆102Oct 16, 2025Updated 4 months ago
- Full models and training code for PESTO☆75Jun 12, 2024Updated last year
- ☆251Feb 14, 2024Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- large vocabulary automatic chord estimation with deep learning☆18Jun 2, 2021Updated 4 years ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆23Jan 27, 2025Updated last year
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated last month
- Unofficial implementation of MT3: Multi-Task Multitrack Music Transcription (Google Research, 2022) in pytorch☆23Aug 16, 2023Updated 2 years ago
- ☆28Jul 31, 2025Updated 7 months ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 9 months ago
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆434May 25, 2025Updated 9 months ago
- ☆55Nov 5, 2024Updated last year
- Training, validation, and inference code for various SSL approaches and architectures.☆79Oct 22, 2025Updated 4 months ago
- ☆23Sep 27, 2023Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Nov 25, 2023Updated 2 years ago
- Searching for Music Mixing Graphs: A Pruning Approach☆25Feb 13, 2025Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated last month
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆80Nov 7, 2025Updated 3 months ago
- Fast CosyVoice3 inference with tensorRT and tensorRT-LLM☆54Feb 15, 2026Updated 2 weeks ago
- The On-the-fly MIDI Augmentation Library!☆32Mar 30, 2025Updated 11 months ago
- ATEPP is a dataset of expressive piano performances by virtuoso pianists. (ISMIR2022)☆53Aug 5, 2024Updated last year