lucacoma / DiffTransfer
Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)
☆27Updated last year
Alternatives and similar repositories for DiffTransfer:
Users that are interested in DiffTransfer are comparing it to the libraries listed below
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 10 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆40Updated last month
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Project for MIDI to Audio Synthesis☆21Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated 11 months ago
- ☆43Updated 8 months ago
- ☆10Updated last year
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆46Updated 5 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆21Updated last year
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆42Updated 4 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- ☆19Updated 5 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆36Updated 8 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆57Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆35Updated last year
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆36Updated 5 months ago
- Landing Page for All Things Source Separation☆22Updated 3 months ago
- Official source codes of coco-mulla☆32Updated 11 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Chorale Music Separation Dataset and Model Framework☆35Updated 2 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆66Updated 7 months ago
- ICASSP 2022☆61Updated 3 years ago
- ☆87Updated 2 years ago