hiromu / contrastive-singing-voicesLinks
Implementation of "Self-Supervised Contrastive Learning for Singing Voices"
☆19Updated 3 years ago
Alternatives and similar repositories for contrastive-singing-voices
Users that are interested in contrastive-singing-voices are comparing it to the libraries listed below
Sorting:
- Frechet Audio Distance evaluation in PyTorch☆34Updated last year
- A repo that builds text to music datasets from scratch☆21Updated 2 weeks ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 7 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆28Updated last year
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- Addressing the confounds of accompaniments in singer identification☆18Updated 5 years ago
- ☆10Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- A piano music dataset with Audio, Symbolic and Text labels☆27Updated 3 months ago
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆33Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- ☆18Updated 5 years ago
- ☆26Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆33Updated 3 weeks ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Updated 2 years ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated 2 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆32Updated 2 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆20Updated 2 years ago
- Code and demo for paper: Zhao et al., "Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement," IJCAI 202…☆18Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆70Updated 2 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆42Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆25Updated last year
- ☆18Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 3 years ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆22Updated 3 months ago