Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆14Sep 18, 2020Updated 5 years ago
Alternatives and similar repositories for AT-GCN
Users that are interested in AT-GCN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Nov 30, 2020Updated 5 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Project for training SSL-based deepfake speech detector☆52Mar 30, 2026Updated 2 months ago
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 3 months ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- ☆55Jun 3, 2020Updated 6 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- ☆60Jul 2, 2024Updated last year
- ☆38Oct 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 6 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- ☆14Jun 9, 2021Updated 5 years ago
- tensorflow integration with mcdermottlab/pycochleagram☆19Jul 27, 2022Updated 3 years ago
- ☆12Nov 12, 2024Updated last year
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆67Dec 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- Baseline of DCASE 2020 task 4☆43Oct 24, 2022Updated 3 years ago
- ☆20May 13, 2019Updated 7 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 3 years ago
- A list of papers about audio captioning☆79Jul 1, 2022Updated 3 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 7 months ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆31May 22, 2026Updated 3 weeks ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆51Dec 17, 2024Updated last year
- ☆15Sep 17, 2024Updated last year
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆168May 14, 2022Updated 4 years ago
- The 2018 LifeCLEF bird identification task baseline system.☆52Dec 30, 2021Updated 4 years ago