ksanjeevan / crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
☆385Updated 3 years ago
Alternatives and similar repositories for crnn-audio-classification:
Users that are interested in crnn-audio-classification are comparing it to the libraries listed below
- ☆223Updated 5 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆126Updated 3 years ago
- Environmental sound classification using Deep Learning with extracted features☆165Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆646Updated 2 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆493Updated 3 years ago
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆187Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆999Updated last month
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆380Updated 3 years ago
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…☆348Updated 2 years ago
- Problem Agnostic Speech Encoder☆440Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆504Updated 2 years ago
- Audio transformations library for PyTorch☆230Updated 2 years ago
- ☆483Updated 7 months ago
- This code aims at weakly-labeled semi-supervised sound event detection. The code embraces two methods we proposed to solve this task: sp…☆127Updated 4 years ago
- spafe: Simplified Python Audio Features Extraction☆464Updated 8 months ago
- Speech Enhancement Generative Adversarial Network in PyTorch☆386Updated last year
- Utterance-level Aggregation For Speaker Recognition In The Wild☆367Updated last year
- An implementation of vggish in keras with tf backend☆117Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆310Updated 4 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- A library for speech data augmentation in time-domain☆655Updated 3 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆207Updated 2 years ago
- Audio processing by using pytorch 1D convolution network☆1,052Updated last year
- Official repository for RawNet, RawNet2, and RawNet3☆369Updated 11 months ago
- An STFT/iSTFT for PyTorch.☆355Updated last year
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆165Updated 2 years ago
- ☆104Updated 4 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆73Updated 2 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago