iver56/audiomentations

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iver56/audiomentations)

iver56 / audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

☆2,302

Alternatives and similar repositories for audiomentations

Users that are interested in audiomentations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,160Nov 24, 2025Updated 7 months ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
KinWaiCheuk / nnAudio
View on GitHub
Audio processing by using pytorch 1D convolution network
☆1,129May 21, 2026Updated 2 months ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,576May 13, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,699Jun 15, 2026Updated last month
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
microsoft / DNS-Challenge
View on GitHub
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
☆1,446Jul 25, 2024Updated last year
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,765Jul 25, 2024Updated last year
google-research / leaf-audio
View on GitHub
LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…
☆528Mar 1, 2022Updated 4 years ago
Spijkervet / torchaudio-augmentations
View on GitHub
Audio transformations library for PyTorch
☆239Apr 19, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pytorch / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆2,915Updated this week
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,464May 21, 2023Updated 3 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,898Updated this week
pranaymanocha / PerceptualAudio
View on GitHub
Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM
☆382Mar 24, 2023Updated 3 years ago
maxrmorrison / torchcrepe
View on GitHub
Pytorch implementation of the CREPE pitch tracker
☆523May 16, 2025Updated last year
csteinmetz1 / pyloudnorm
View on GitHub
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
☆775Jan 4, 2026Updated 6 months ago
gabrielmittag / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆963Dec 1, 2024Updated last year
LCAV / pyroomacoustics
View on GitHub
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…
☆1,909Updated this week
NVIDIA / BigVGAN
View on GitHub
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,227Sep 5, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jim-schwoebel / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,211Jun 6, 2024Updated 2 years ago
Graphi07 / room-impulse-responses
View on GitHub
A list of publicly available room impulse response datasets and scripts to download them.
☆594May 11, 2026Updated 2 months ago
jik876 / hifi-gan
View on GitHub
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,357Jul 27, 2024Updated last year
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago
WenzheLiu-Speech / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆1,244Nov 14, 2023Updated 2 years ago
fgnt / nara_wpe
View on GitHub
Different implementations of "Weighted Prediction Error" for speech dereverberation
☆566Mar 19, 2025Updated last year
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
nanahou / Awesome-Speech-Enhancement
View on GitHub
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…
☆831Dec 1, 2020Updated 5 years ago
facebookresearch / denoiser
View on GitHub
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…
☆1,904Mar 14, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
descriptinc / descript-audio-codec
View on GitHub
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,839Updated this week
schmiph2 / pysepm
View on GitHub
Python implementation of performance metrics in Loizou's Speech Enhancement book
☆456Feb 15, 2025Updated last year
Audio-WestlakeU / FullSubNet
View on GitHub
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
☆606Aug 19, 2023Updated 2 years ago
facebookresearch / encodec
View on GitHub
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
☆3,998Jan 4, 2024Updated 2 years ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated 3 weeks ago
bootphon / phonemizer
View on GitHub
Simple text to phones converter for multiple languages
☆1,558Sep 26, 2024Updated last year
gemelo-ai / vocos
View on GitHub
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
☆1,143Aug 7, 2024Updated last year