stdio2016 / pfann
Neural Network Audio FingerPrint
☆58Updated 2 years ago
Alternatives and similar repositories for pfann:
Users that are interested in pfann are comparing it to the libraries listed below
- Official PyTorch implementation of CoverHunter☆28Updated 3 months ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆53Updated last year
- metadata for SHS100K☆22Updated 7 years ago
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆33Updated 5 years ago
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆28Updated last month
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆84Updated 11 months ago
- CP-JKU submission to DCASE 19, performant single-model CNN☆56Updated 4 years ago
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆39Updated last year
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆49Updated last year
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Updated 4 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Updated 2 years ago
- experiments about AudioSet☆44Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Official implementation of Neural Audio Fingerprint (ICASSP 2021)☆189Updated 7 months ago
- ☆58Updated 4 years ago
- CP-JKU submission to DCASE 20☆43Updated 3 years ago
- ☆29Updated 2 years ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆95Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 4 months ago
- A Dataset for Cover Song Identification and Understanding☆59Updated 2 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- MultiSV: scripts for data preparation☆27Updated last month
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆29Updated 4 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆56Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago