soerenab / AudioMNIST
β340Updated 6 months ago
Related projects: β
- UrbanSound classification using Convolutional Recurrent Networks in PyTorchβ380Updated 3 years ago
- π¦ A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognitionβ489Updated 3 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.β1,124Updated 3 years ago
- Problem Agnostic Speech Encoderβ439Updated last year
- A library for speech data augmentation in time-domainβ635Updated 3 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brainβ639Updated 2 years ago
- OpenL3: Open-source deep audio and image embeddingsβ452Updated last year
- Audio processing by using pytorch 1D convolution networkβ1,009Updated 7 months ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanksβ¦β494Updated 2 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.β925Updated 2 weeks ago
- An STFT/iSTFT for PyTorch.β342Updated 10 months ago
- Fast PyTorch based DSP for audio and 1D signalsβ421Updated last year
- Speech Enhancement Generative Adversarial Network in PyTorchβ376Updated last year
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAMβ350Updated last year
- A free audio dataset of spoken digits. An audio version of MNIST.β617Updated 4 months ago
- Audio transformations library for PyTorchβ220Updated 2 years ago
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.β218Updated last year
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasetsβ376Updated 5 years ago
- β221Updated 4 years ago
- Improved Wave-U-Net implemented in Pytorchβ302Updated last month
- A library for augmenting annotated audio dataβ231Updated 3 years ago
- β460Updated 2 months ago
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional reβ¦β333Updated last year
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learnβ142Updated last year
- A test bed for updates and new features | pytorch/audioβ169Updated 4 years ago
- A flexible source separation library in Pythonβ605Updated last year
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebookβ164Updated 5 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.β346Updated 2 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversionβ318Updated last year
- A library for soundscape synthesis and augmentationβ375Updated 2 years ago