Liu-Feng-deeplearning / CoverHunter
Official PyTorch implementation of CoverHunter
☆24Updated this week
Related projects ⓘ
Alternatives and complementary repositories for CoverHunter
- metadata for SHS100K☆21Updated 6 years ago
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆25Updated 3 weeks ago
- Neural Network Audio FingerPrint☆56Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆22Updated last month
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆52Updated last year
- experiments about AudioSet☆43Updated last year
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆33Updated 4 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 7 months ago
- Cover Song Detection System☆10Updated 5 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆63Updated 2 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆32Updated last week
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- real-time speech enhance☆12Updated 10 months ago
- ☆13Updated last year
- An end-to-end chorus detection model DeepChorus.☆30Updated 2 years ago
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆37Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆67Updated 2 weeks ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated 2 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- Spherical residual vector quantization (SRVQ)☆26Updated 3 months ago
- ☆21Updated 7 months ago
- ☆49Updated last year
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆11Updated last year
- music semantic understanding evaluation benchmark☆25Updated last year
- Query-conditioned target sound extraction model☆18Updated 3 weeks ago
- Prediction of sound event bounding boxes (SEBBs)☆22Updated 3 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆58Updated this week
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago