KinWaiCheuk / nnAudioLinks
Audio processing by using pytorch 1D convolution network
☆1,078Updated 2 months ago
Alternatives and similar repositories for nnAudio
Users that are interested in nnAudio are comparing it to the libraries listed below
Sorting:
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,065Updated 6 months ago
- Collection of audio-focused loss functions in PyTorch☆798Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆512Updated 3 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,100Updated last week
- A library for speech data augmentation in time-domain☆668Updated 3 years ago
- ☆497Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆446Updated 5 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,425Updated last week
- Implementation of the Wave-U-Net for audio source separation☆902Updated 2 years ago
- spafe: Simplified Python Audio Features Extraction☆476Updated 4 months ago
- A flexible source separation library in Python☆635Updated 7 months ago
- A library for soundscape synthesis and augmentation☆405Updated 3 years ago
- OpenL3: Open-source deep audio and image embeddings☆532Updated 2 years ago
- Improved Wave-U-Net implemented in Pytorch☆350Updated last year
- ☆682Updated 9 months ago
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆713Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,015Updated last year
- An open source dataset for source separation☆439Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆538Updated last year
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,611Updated last year
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆329Updated 2 years ago
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆542Updated 2 weeks ago
- Python library for downloading, loading & working with sound datasets☆343Updated this week
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆780Updated 4 years ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆367Updated 2 years ago
- A PyTorch implementation of DNN-based source separation.☆303Updated 3 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆513Updated 3 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆652Updated 3 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆719Updated 2 years ago
- ☆424Updated last year