asteroid-team / torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,017Updated 2 months ago
Alternatives and similar repositories for torch-audiomentations:
Users that are interested in torch-audiomentations are comparing it to the libraries listed below
- Audio processing by using pytorch 1D convolution network☆1,055Updated last year
- A library for speech data augmentation in time-domain☆656Updated 3 years ago
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,983Updated this week
- ☆485Updated 8 months ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆504Updated 3 years ago
- Collection of audio-focused loss functions in PyTorch☆768Updated 7 months ago
- An open source dataset for source separation☆410Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆647Updated 2 years ago
- Efficient Training of Audio Transformers with Patchout☆326Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆438Updated last month
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆492Updated 3 years ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆307Updated last year
- Audio transformations library for PyTorch☆230Updated 2 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆378Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,348Updated 2 months ago
- spafe: Simplified Python Audio Features Extraction☆466Updated this week
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆360Updated 2 years ago
- A PyTorch implementation of DNN-based source separation.☆296Updated 2 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,246Updated last year
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆507Updated 3 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆748Updated 4 years ago
- End-to-End Neural Diarization☆397Updated 3 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆270Updated 4 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆952Updated last year
- ☆1,430Updated 7 months ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆393Updated 7 months ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆210Updated last year
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆369Updated 3 years ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆583Updated last month
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆313Updated last year