iver56 / torch-audiomentationsLinks
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,069Updated 6 months ago
Alternatives and similar repositories for torch-audiomentations
Users that are interested in torch-audiomentations are comparing it to the libraries listed below
Sorting:
- Audio processing by using pytorch 1D convolution network☆1,079Updated 2 months ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,104Updated 2 weeks ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆512Updated 3 years ago
- A library for speech data augmentation in time-domain☆668Updated 3 years ago
- ☆497Updated last year
- Collection of audio-focused loss functions in PyTorch☆798Updated last year
- The PyTorch-based audio source separation toolkit for researchers☆2,429Updated 2 weeks ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆652Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆343Updated last year
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆494Updated 4 years ago
- spafe: Simplified Python Audio Features Extraction☆476Updated 4 months ago
- An open source dataset for source separation☆440Updated last year
- ☆1,529Updated last year
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,326Updated 2 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆989Updated 2 years ago
- Tools for handling multimodal data in machine learning projects.☆1,048Updated this week
- ☆683Updated 10 months ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆781Updated 4 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆393Updated 2 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆383Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆538Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆446Updated 5 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆827Updated 8 months ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,259Updated last year
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆325Updated last year
- OpenL3: Open-source deep audio and image embeddings☆533Updated 2 years ago
- End-to-End Neural Diarization☆404Updated 3 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,186Updated 4 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆720Updated 2 years ago
- Audio transformations library for PyTorch☆233Updated 3 years ago