SubramaniKrishna / point-cloud-audio
Accompanying code for our paper "Point Cloud Audio Processing"
☆19Updated 3 years ago
Related projects: ⓘ
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆31Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆20Updated 2 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated last month
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆21Updated 2 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- ☆14Updated 2 years ago
- Project for MIDI to Audio Synthesis☆19Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated 11 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆28Updated last year
- Addressing the confounds of accompaniments in singer identification☆18Updated 4 years ago
- ☆32Updated 3 years ago
- ☆18Updated 2 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Updated 2 years ago
- Unofficial PyTorch dataset for Slakh☆9Updated 3 years ago
- Frechet Audio Distance evaluation in PyTorch☆34Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆25Updated 4 months ago
- ☆14Updated 3 weeks ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆33Updated 6 months ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆49Updated 2 years ago
- ☆21Updated 2 years ago
- ☆9Updated 7 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ☆78Updated last year
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆20Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- Chorale Music Separation Dataset and Model Framework☆31Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 4 months ago