soerenab / AudioMNIST
☆348Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for AudioMNIST
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆963Updated 2 weeks ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆978Updated last year
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆490Updated 3 years ago
- Audio processing by using pytorch 1D convolution network☆1,032Updated 9 months ago
- Problem Agnostic Speech Encoder☆439Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,140Updated 3 years ago
- An STFT/iSTFT for PyTorch.☆353Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆641Updated 2 years ago
- A library for speech data augmentation in time-domain☆647Updated 3 years ago
- A free audio dataset of spoken digits. An audio version of MNIST.☆626Updated 6 months ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆354Updated last year
- Speech Enhancement Generative Adversarial Network in PyTorch☆379Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆501Updated 2 years ago
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…☆343Updated 2 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆383Updated 3 years ago
- Fast PyTorch based DSP for audio and 1D signals☆427Updated 2 years ago
- Fetch and use Google's AudioSet dataset☆124Updated 7 years ago
- An open source dataset for source separation☆381Updated 9 months ago
- A library for soundscape synthesis and augmentation☆381Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆402Updated 3 years ago
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆220Updated last year
- ☆151Updated 3 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆76Updated 6 years ago
- Python library for downloading, loading & working with sound datasets☆325Updated last month
- ☆223Updated 4 years ago
- A Python wrapper for the high-quality vocoder "World"☆725Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆494Updated 2 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆908Updated last year
- ☆129Updated 2 months ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆315Updated 4 years ago