Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆13Sep 18, 2020Updated 5 years ago
Alternatives and similar repositories for AT-GCN
Users that are interested in AT-GCN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Nov 30, 2020Updated 5 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Project for training SSL-based deepfake speech detector☆47Mar 18, 2026Updated last week
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated last month
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- ☆60Jul 2, 2024Updated last year
- ☆36Oct 15, 2024Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆14Jun 9, 2021Updated 4 years ago
- tensorflow integration with mcdermottlab/pycochleagram☆19Jul 27, 2022Updated 3 years ago
- ☆12Nov 12, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- ☆20May 13, 2019Updated 6 years ago
- Baseline of DCASE 2020 task 4☆43Oct 24, 2022Updated 3 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 3 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 5 months ago
- Code and dataset for the paper "IsarStep: a Benchmark for High-level Mathematical Reasoning"☆12Mar 15, 2021Updated 5 years ago
- A list of papers about audio captioning☆79Jul 1, 2022Updated 3 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 4 months ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Nov 16, 2018Updated 7 years ago
- ☆14Sep 17, 2024Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆50Dec 17, 2024Updated last year
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169May 14, 2022Updated 3 years ago
- Code for the paper: "Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information"☆21Oct 10, 2021Updated 4 years ago