Official PyTorch implementation of CoverHunter
☆32Nov 21, 2024Updated last year
Alternatives and similar repositories for CoverHunter
Users that are interested in CoverHunter are comparing it to the libraries listed below
Sorting:
- Fast constant-Q transform feature, c++ implement☆11Jul 6, 2023Updated 2 years ago
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆32Sep 10, 2025Updated 5 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Fork of Liu Feng's CoverHunter to run on a single computer, plus more features and documentation.☆16Feb 17, 2026Updated 2 weeks ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated 3 weeks ago
- ☆29Mar 19, 2025Updated 11 months ago
- Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)☆18Nov 7, 2023Updated 2 years ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆54Jun 15, 2023Updated 2 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 4 months ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- Cover Song Detection System☆10Mar 29, 2019Updated 6 years ago
- Official implementation of Neural Audio Fingerprint (ICASSP 2021)☆203Aug 21, 2025Updated 6 months ago
- ☆13Dec 18, 2017Updated 8 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆66Jun 3, 2023Updated 2 years ago
- Code for reproducting the paper Music Augmentation and Denoising For Peak-Based Audio Fingerprinting☆16Oct 31, 2023Updated 2 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Jun 21, 2018Updated 7 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.☆17Jul 24, 2024Updated last year
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆106Jan 10, 2025Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Code of our ISMIR 2025 paper - D. Afchar, G. Meseguer Brocal, K. Akesbi, R. Hennequin☆34Nov 12, 2025Updated 3 months ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Sep 19, 2017Updated 8 years ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 5 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆89Feb 2, 2026Updated last month
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆160Nov 12, 2022Updated 3 years ago
- ☆251Feb 14, 2024Updated 2 years ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆43Aug 29, 2024Updated last year