stdio2016 / pfann
Neural Network Audio FingerPrint
☆59Updated 2 years ago
Alternatives and similar repositories for pfann:
Users that are interested in pfann are comparing it to the libraries listed below
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆53Updated last year
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆95Updated 3 years ago
- metadata for SHS100K☆23Updated 7 years ago
- Official PyTorch implementation of CoverHunter☆29Updated 4 months ago
- CP-JKU submission to DCASE 20☆44Updated 3 years ago
- experiments about AudioSet☆44Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆86Updated last year
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆30Updated last month
- Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"☆15Updated 3 years ago
- A unofficial Pytorch implementation of Google's VoiceFilter☆100Updated last year
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆90Updated last year
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆33Updated 5 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Updated 4 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Source code for Consistent ensemble distillation for audio tagging☆30Updated 8 months ago
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Updated 4 years ago
- ☆32Updated 2 years ago
- ☆47Updated 4 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- ☆78Updated 2 years ago
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆39Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- ☆65Updated last year
- This code is to run the WARP-Q speech quality metric.☆35Updated 5 months ago