asteroid-team / torch-audiomentationsLinks

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

☆1,046

Alternatives and similar repositories for torch-audiomentations

Users that are interested in torch-audiomentations are comparing it to the libraries listed below

Sorting:

KinWaiCheuk / nnAudio
Audio processing by using pytorch 1D convolution network
☆1,069Updated 2 weeks ago
iver56 / audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,053Updated last week
qiuqiangkong / torchlibrosa
☆493Updated 11 months ago
facebookresearch / WavAugment
A library for speech data augmentation in time-domain
☆661Updated 3 years ago
google-research / leaf-audio
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…
☆509Updated 3 years ago
csteinmetz1 / auraloss
Collection of audio-focused loss functions in PyTorch
☆783Updated 10 months ago
JorisCos / LibriMix
An open source dataset for source separation
☆423Updated last year
Spijkervet / torchaudio-augmentations
Audio transformations library for PyTorch
☆232Updated 3 years ago
adefossez / julius
Fast PyTorch based DSP for audio and 1D signals
☆442Updated 3 months ago
DemisEom / SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆649Updated 3 years ago
YuanGongND / ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆385Updated 2 years ago
kkoutini / PaSST
Efficient Training of Audio Transformers with Patchout
☆335Updated last year
zcaceres / spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆494Updated 3 years ago
NVIDIA / CleanUNet
Official PyTorch Implementation of CleanUNet (ICASSP 2022)
☆322Updated last year
DmitryRyumin / INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …
☆673Updated 5 months ago
SuperKogito / spafe
spafe: Simplified Python Audio Features Extraction
☆474Updated 2 months ago
aliutkus / speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆969Updated last year
sp-uhh / sgmse
Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation
☆621Updated last week
fschmid56 / EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆285Updated 6 months ago
santi-pdp / pase
Problem Agnostic Speech Encoder
☆441Updated last year
hitachi-speech / EEND
End-to-End Neural Diarization
☆403Updated 3 years ago
etzinis / sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…
☆324Updated last year
f90 / Wave-U-Net-Pytorch
Improved Wave-U-Net implemented in Pytorch
☆343Updated 10 months ago
microsoft / MS-SNSD
The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…
☆528Updated 11 months ago
marl / openl3
OpenL3: Open-source deep audio and image embeddings
☆518Updated last year
pranaymanocha / PerceptualAudio
Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM
☆362Updated 2 years ago
microsoft / UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
☆463Updated last year
asteroid-team / asteroid
The PyTorch-based audio source separation toolkit for researchers
☆2,384Updated 4 months ago
facebookresearch / AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆590Updated last year
gabrielmittag / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆795Updated 6 months ago