EIHW / Attention-based_Atrous_CNN
Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes', by Zhao Ren, Qiuqiang Kong, Jing Han, Mark Plumbley, Björn Schuller.
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-based_Atrous_CNN:
Users that are interested in Attention-based_Atrous_CNN are comparing it to the libraries listed below
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- ☆20Updated 6 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- ☆36Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- ☆53Updated 4 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- An implementation of capsule routing for sound event detection☆15Updated 6 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆13Updated 3 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- ☆15Updated 3 years ago
- ☆16Updated 5 years ago
- ☆17Updated 6 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- Language modelling for sound event detection☆21Updated 5 years ago
- Dialect identification using Siamese network☆15Updated 7 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Updated 5 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 5 months ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Updated 6 years ago
- ☆14Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆17Updated 3 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Updated 4 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago