KinWaiCheuk / nnAudioLinks
Audio processing by using pytorch 1D convolution network
☆1,083Updated 3 months ago
Alternatives and similar repositories for nnAudio
Users that are interested in nnAudio are comparing it to the libraries listed below
Sorting:
- Collection of audio-focused loss functions in PyTorch☆809Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,083Updated 7 months ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆513Updated 3 years ago
- ☆500Updated last year
- A library for speech data augmentation in time-domain☆671Updated 4 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,132Updated this week
- Fast PyTorch based DSP for audio and 1D signals☆447Updated 6 months ago
- Implementation of the Wave-U-Net for audio source separation☆909Updated 2 years ago
- spafe: Simplified Python Audio Features Extraction☆477Updated 5 months ago
- OpenL3: Open-source deep audio and image embeddings☆539Updated 2 years ago
- ☆687Updated 11 months ago
- A flexible source separation library in Python☆640Updated 9 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,453Updated last month
- A library for soundscape synthesis and augmentation☆406Updated 3 years ago
- An open source dataset for source separation☆443Updated last year
- Improved Wave-U-Net implemented in Pytorch☆354Updated last year
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,001Updated 2 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆783Updated 4 years ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,667Updated 3 months ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆547Updated last year
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆726Updated 2 years ago
- Deep learning for audio denoising☆732Updated last year
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆719Updated last year
- A PyTorch implementation of DNN-based source separation.☆305Updated 3 years ago
- Audio transformations library for PyTorch☆234Updated 3 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,020Updated 2 years ago
- ☆426Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆653Updated 3 years ago
- Evaluation functions for music/audio information retrieval/signal processing algorithms.☆665Updated last month
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆553Updated last month