KinWaiCheuk / nnAudio
Audio processing by using pytorch 1D convolution network
☆1,032Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for nnAudio
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆963Updated last week
- Collection of audio-focused loss functions in PyTorch☆744Updated 3 months ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆501Updated 2 years ago
- ☆471Updated 4 months ago
- OpenL3: Open-source deep audio and image embeddings☆467Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆427Updated 2 years ago
- A library for speech data augmentation in time-domain☆647Updated 3 years ago
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,875Updated last week
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆641Updated 2 years ago
- Implementation of the Wave-U-Net for audio source separation☆844Updated last year
- Improved Wave-U-Net implemented in Pytorch☆312Updated 3 months ago
- Audio transformations library for PyTorch☆226Updated 2 years ago
- spafe: Simplified Python Audio Features Extraction☆460Updated 5 months ago
- A library for soundscape synthesis and augmentation☆380Updated 2 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆978Updated last year
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆354Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆908Updated last year
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆309Updated 3 months ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆490Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆305Updated 10 months ago
- An open source dataset for source separation☆380Updated 9 months ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆364Updated 2 years ago
- A PyTorch implementation of DNN-based source separation.☆291Updated 2 years ago
- ☆652Updated last month
- A flexible source separation library in Python☆622Updated last year
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆499Updated 2 years ago
- Python library for downloading, loading & working with sound datasets☆324Updated last month
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,139Updated 3 years ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆308Updated last year
- Problem Agnostic Speech Encoder☆439Updated last year