drscotthawley / fad_pytorch
Frechet Audio Distance evaluation in PyTorch
☆35Updated last year
Alternatives and similar repositories for fad_pytorch:
Users that are interested in fad_pytorch are comparing it to the libraries listed below
- music semantic understanding evaluation benchmark☆25Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆41Updated last year
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆27Updated last month
- Polyphonic generalisation of DDSP☆18Updated 11 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 7 months ago
- Differentiable dynamic range controller in PyTorch.☆47Updated 4 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆64Updated 3 weeks ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆26Updated 11 months ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated 2 years ago
- ☆17Updated 3 years ago
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆16Updated 8 months ago
- list of MIR dataset papers presented at ISMIR 2022☆61Updated 2 years ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆36Updated 9 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆20Updated 2 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆59Updated 2 years ago
- PyTorch version of Spotify's Basic Pitch☆34Updated 11 months ago
- Code and demo for paper: Zhao et al., Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling, in NeurIPS 2024.☆28Updated 3 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 5 months ago
- ☆53Updated 5 months ago
- ☆28Updated last year
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆40Updated 9 months ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆21Updated 11 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆34Updated 4 months ago
- ☆23Updated 11 months ago
- Audio production style transfer with inference-time optimization☆36Updated 4 months ago
- ☆44Updated last year
- Using Word embeddings for automatic EQ mixing☆13Updated 3 years ago