lucacoma / DiffTransfer
Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)
☆26Updated 6 months ago
Related projects: ⓘ
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 4 months ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆33Updated 10 months ago
- Deep Performer: Score-to-audio music performance synthesis☆41Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- Frechet Audio Distance evaluation in PyTorch☆34Updated last year
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆31Updated last month
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆35Updated 2 weeks ago
- ☆37Updated 3 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆26Updated 2 years ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆27Updated last week
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆25Updated 4 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆31Updated last week
- ☆18Updated 4 months ago
- Chorale Music Separation Dataset and Model Framework☆31Updated last year
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆22Updated 3 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆46Updated 6 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆19Updated 9 months ago
- ☆26Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- ☆93Updated 3 weeks ago
- Robust Singing Voice Transcription and MIDI Extraction☆47Updated last month
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆62Updated 2 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated 8 months ago
- Official source codes of coco-mulla☆28Updated 5 months ago
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 7 months ago
- ☆21Updated 3 weeks ago
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆22Updated last year
- Polyphonic generalisation of DDSP☆15Updated 4 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆17Updated 4 months ago