wangyu / rethink-audio-fslView external linksLinks
Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)
☆43May 24, 2022Updated 3 years ago
Alternatives and similar repositories for rethink-audio-fsl
Users that are interested in rethink-audio-fsl are comparing it to the libraries listed below
Sorting:
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Sep 22, 2024Updated last year
- Representation Learning for the Automatic Indexing of Sound Effects Libraries (ISMIR 2022): Deep audio embeddings pre-trained on UCS & No…☆47Jun 21, 2023Updated 2 years ago
- ☆20Aug 26, 2022Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Pytorch implementation of [Learning to match transient sound events using attentional similarity for few-shot sound recognition]☆33Feb 27, 2019Updated 6 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- ☆23Aug 30, 2022Updated 3 years ago
- ☆20Mar 12, 2025Updated 11 months ago
- A voice spoofing detection system, based on paper presented at ICSPIS 2021☆10Feb 11, 2022Updated 4 years ago
- ☆10Oct 9, 2025Updated 4 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- On-going VA modeling research. Modeling dynamic range compressor using S4.☆19Nov 29, 2025Updated 2 months ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆80Jul 12, 2024Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31May 31, 2023Updated 2 years ago
- Website for the ISMIR 2023 Tutorial: Few-shot and Zero-shot Learning for MIR☆30Jan 3, 2023Updated 3 years ago
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- ☆11Feb 17, 2017Updated 9 years ago
- ☆49Jul 29, 2021Updated 4 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- a new family of super small music generation models focusing on experimental music and latent space exploration capabilities☆36May 9, 2024Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆37Feb 24, 2025Updated 11 months ago
- CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]☆22Jun 12, 2025Updated 8 months ago
- Repository for DNN training, theory to practice, part of the Large Scale Machine Learning class at Mines Paritech☆12Mar 11, 2022Updated 3 years ago
- Inference code for PaSST, using the HEAR API.☆33Jan 2, 2024Updated 2 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated 3 weeks ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆16Jul 31, 2025Updated 6 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆335Jul 25, 2024Updated last year
- Python library for downloading, loading & working with sound datasets☆350Sep 23, 2025Updated 4 months ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Mar 30, 2023Updated 2 years ago
- ISMIR 2020 Tutorial for Metric Learning in MIR☆127Oct 25, 2020Updated 5 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- ☆12May 30, 2023Updated 2 years ago