Pexeso / audio-fingerprinting-benchmark-toolkitView external linksLinks
☆21Dec 19, 2023Updated 2 years ago
Alternatives and similar repositories for audio-fingerprinting-benchmark-toolkit
Users that are interested in audio-fingerprinting-benchmark-toolkit are comparing it to the libraries listed below
Sorting:
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆29Sep 11, 2025Updated 5 months ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Apr 14, 2020Updated 5 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆22Dec 8, 2022Updated 3 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- ☆11Nov 7, 2024Updated last year
- Autoencoder Based Real-Time Timbre Interpolation Algorithm☆12Aug 17, 2020Updated 5 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.☆13Mar 24, 2023Updated 2 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Apr 16, 2024Updated last year
- Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)☆32Sep 10, 2025Updated 5 months ago
- A musical instrument audio dataset generated on-the-fly using FM synthesis.☆39Dec 30, 2025Updated last month
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- This repository contains the migrated code of Spleeter from Deezer in TF2.0☆28Jan 20, 2021Updated 5 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- ☆15Apr 2, 2025Updated 10 months ago
- Code for reproducting the paper Music Augmentation and Denoising For Peak-Based Audio Fingerprinting☆16Oct 31, 2023Updated 2 years ago
- Official PyTorch implementation of 'Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconst…☆29Dec 25, 2025Updated last month
- Dippy Synthetic Speech Subnet☆17Sep 11, 2025Updated 5 months ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- ☆24Mar 29, 2025Updated 10 months ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Feb 9, 2026Updated last week
- enhan(t) is an open source toolkit which enables you to enhance the web experience of existing video conferencing solutions like Zoom, MS…☆15Apr 28, 2022Updated 3 years ago
- ☆32Oct 23, 2025Updated 3 months ago
- ☆16Oct 16, 2018Updated 7 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week
- ☆16Sep 28, 2020Updated 5 years ago
- A Pytorch implementation of "Denoising Auto-encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation"☆13Jul 3, 2019Updated 6 years ago
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆24Jul 30, 2025Updated 6 months ago
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆67Nov 21, 2022Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 3 months ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆15Jul 4, 2024Updated last year