Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆13Sep 18, 2020Updated 5 years ago
Alternatives and similar repositories for AT-GCN
Users that are interested in AT-GCN are comparing it to the libraries listed below
Sorting:
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Nov 30, 2020Updated 5 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 2 weeks ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- ☆14Jun 9, 2021Updated 4 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- Project for training SSL-based deepfake speech detector☆46Feb 2, 2026Updated last month
- ☆60Jul 2, 2024Updated last year
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- ☆20May 13, 2019Updated 6 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- Code for the paper: "Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information"☆21Oct 10, 2021Updated 4 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆68Dec 20, 2021Updated 4 years ago
- Baseline of dcase 2019 task 4☆62Sep 2, 2022Updated 3 years ago
- ☆36Oct 15, 2024Updated last year
- A two step optimization for sound source separation on the adaptive front-end domain☆71Sep 18, 2020Updated 5 years ago
- A list of papers about audio captioning☆79Jul 1, 2022Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆10Jun 7, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆12Nov 12, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆12Jun 22, 2020Updated 5 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- DCASE 2017 Baseline system☆82Jun 26, 2020Updated 5 years ago
- Evaluation toolbox for Sound Event Detection☆158Jun 12, 2024Updated last year
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago