COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
☆48Jul 25, 2024Updated last year
Alternatives and similar repositories for coala
Users that are interested in coala are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- ☆14Nov 22, 2022Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- ☆58Nov 2, 2020Updated 5 years ago
- Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.☆120Dec 13, 2021Updated 4 years ago
- NASH 2021 project... this may or may not end up working 🤷♂️☆12Dec 19, 2021Updated 4 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- Convolutional Neural Network for multitrack mix leveling☆18Jun 25, 2018Updated 7 years ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Dissimilarity Matrix and Sounds from Timbre Space Representation of a Subtractive Synthesizer (Timbre, 2020)☆12Dec 17, 2021Updated 4 years ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 6 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Efficient neural networks for analog audio effect modeling☆169Jun 9, 2022Updated 3 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- A PyTorch implementation of the musicnn model for music audio tagging☆38Jul 25, 2024Updated last year
- VIsually-Pivoted Audio and(N) Text☆22May 16, 2022Updated 3 years ago
- companion repository to the DAFx-19 paper "Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders" by Ad…☆11Jun 22, 2019Updated 6 years ago
- A Pytorch implementation of Onsets and Frames (Hawthorne 2018)☆13Nov 10, 2020Updated 5 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- ☆437Nov 1, 2023Updated 2 years ago
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆144Oct 6, 2023Updated 2 years ago
- ☆47Nov 13, 2021Updated 4 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆64Apr 2, 2020Updated 5 years ago
- Realtime (streaming) DDSP in PyTorch compatible with neutone☆50Feb 4, 2025Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Lottery ticket hypothesis for deep generative models☆11Jul 31, 2020Updated 5 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 6 months ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆54Jun 15, 2023Updated 2 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆368Feb 16, 2026Updated last month
- 我们安全了-暂时的☆11May 11, 2018Updated 7 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Feb 1, 2019Updated 7 years ago
- music semantic understanding evaluation benchmark☆25Aug 12, 2023Updated 2 years ago