EIHW / Attention-based_Atrous_CNN
Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes', by Zhao Ren, Qiuqiang Kong, Jing Han, Mark Plumbley, Björn Schuller.
☆14Updated 4 years ago
Alternatives and similar repositories for Attention-based_Atrous_CNN:
Users that are interested in Attention-based_Atrous_CNN are comparing it to the libraries listed below
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- ☆20Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Language modelling for sound event detection☆21Updated 5 years ago
- An implementation of capsule routing for sound event detection☆15Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- ☆53Updated 4 years ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆12Updated 3 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- ☆16Updated 4 years ago
- ☆36Updated 2 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Updated 6 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 6 years ago
- ☆17Updated 3 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆16Updated 4 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Updated 4 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- ☆17Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆39Updated 2 years ago
- An evaluation toolkit for voice conversion models.☆40Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago