stdio2016 / pfann
Neural Network Audio FingerPrint
☆59Updated 2 years ago
Alternatives and similar repositories for pfann:
Users that are interested in pfann are comparing it to the libraries listed below
- Official PyTorch implementation of CoverHunter☆29Updated 5 months ago
- LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK. ICASSP2020☆53Updated last year
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Updated 4 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- metadata for SHS100K☆23Updated 7 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆30Updated last month
- Temporal Pyramid Pooling Convolutional Neural Network for Cover Song Identification☆33Updated 5 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- ☆29Updated 2 years ago
- Discriminative Condition-Aware PLDA☆43Updated 9 months ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- ☆32Updated 2 years ago
- CP-JKU submission to DCASE 20☆44Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Pytorch implementation of paper "High Fidelity Speech Regeneration With Application to Speech Enhancement"☆15Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Code for DCASE 2020 task 1a and task 1b.☆86Updated 3 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 5 years ago
- experiments about AudioSet☆44Updated last year
- ☆25Updated 6 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆22Updated 2 months ago
- Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation☆96Updated 3 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆30Updated 9 months ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Updated 5 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 4 years ago