csteinmetz1 / st-ito
Audio production style transfer with inference-time optimization
☆16Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for st-ito
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆20Updated 7 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆11Updated 2 weeks ago
- Using Word embeddings for automatic EQ mixing☆11Updated 2 years ago
- Polyphonic generalisation of DDSP☆16Updated 6 months ago
- ☆21Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆37Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆20Updated 11 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 6 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆34Updated last month
- Project for MIDI to Audio Synthesis☆22Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆14Updated this week
- Differentiable dynamic range controller in PyTorch.☆44Updated 2 months ago
- ☆21Updated last month
- ☆42Updated last week
- ISMIR 24 Supplementary Material☆12Updated 2 weeks ago
- ☆11Updated 4 years ago
- ☆21Updated 6 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated 3 weeks ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated last year
- Implementation of FiNS model for RIR estimation☆25Updated last year
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆13Updated 3 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆30Updated 10 months ago
- Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems☆35Updated 2 weeks ago
- Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differenti…☆12Updated 7 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆19Updated 6 months ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated last year
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 5 months ago