stdio2016 / pfann
Neural Network Audio FingerPrint
☆56Updated last year
Related projects ⓘ
Alternatives and complementary repositories for pfann
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆52Updated last year
- Official PyTorch implementation of CoverHunter☆24Updated 7 months ago
- metadata for SHS100K☆21Updated 6 years ago
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆33Updated 4 years ago
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆25Updated 3 weeks ago
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆37Updated last year
- experiments about AudioSet☆43Updated last year
- A Dataset for Cover Song Identification and Understanding☆57Updated last year
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Updated 3 years ago
- CP-JKU submission to DCASE 20☆43Updated 3 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆49Updated last year
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆94Updated 3 years ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆21Updated last year
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆71Updated 5 years ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago
- A list of papers about audio captioning☆78Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆49Updated last year
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆78Updated last year
- ☆179Updated 3 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆49Updated 4 months ago