KinWaiCheuk / nnAudioLinks
Audio processing by using pytorch 1D convolution network
☆1,083Updated 4 months ago
Alternatives and similar repositories for nnAudio
Users that are interested in nnAudio are comparing it to the libraries listed below
Sorting:
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,082Updated 8 months ago
- Collection of audio-focused loss functions in PyTorch☆809Updated last year
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆514Updated 3 years ago
- A library for speech data augmentation in time-domain☆671Updated 4 years ago
- ☆500Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆447Updated 7 months ago
- A flexible source separation library in Python☆640Updated 9 months ago
- Implementation of the Wave-U-Net for audio source separation☆909Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,132Updated last week
- OpenL3: Open-source deep audio and image embeddings☆539Updated 2 years ago
- A library for soundscape synthesis and augmentation☆407Updated 3 years ago
- ☆687Updated 11 months ago
- Improved Wave-U-Net implemented in Pytorch☆354Updated last year
- The PyTorch-based audio source separation toolkit for researchers☆2,453Updated last month
- spafe: Simplified Python Audio Features Extraction☆477Updated 5 months ago
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆719Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,196Updated 4 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆726Updated 2 years ago
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆553Updated last month
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆653Updated 3 years ago
- An open source dataset for source separation☆443Updated last year
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆332Updated 2 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,004Updated 2 years ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,667Updated 3 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆332Updated last year
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆786Updated 4 years ago
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,021Updated 2 years ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆371Updated 2 years ago
- ☆427Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆549Updated last year