iver56 / audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
☆1,968Updated this week
Alternatives and similar repositories for audiomentations:
Users that are interested in audiomentations are comparing it to the libraries listed below
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,002Updated last month
- Audio processing by using pytorch 1D convolution network☆1,053Updated last year
- ☆1,416Updated 7 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,333Updated last month
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆504Updated 3 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,222Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆647Updated 2 years ago
- ☆485Updated 8 months ago
- A library for speech data augmentation in time-domain☆655Updated 3 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,594Updated 10 months ago
- Tools for handling speech data in machine learning projects.☆985Updated last week
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,847Updated 8 months ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆493Updated 3 years ago
- In defence of metric learning for speaker recognition☆1,086Updated 11 months ago
- Collection of audio-focused loss functions in PyTorch☆765Updated 7 months ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆787Updated last month
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,160Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆742Updated 3 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,340Updated 3 weeks ago
- List of speech synthesis papers.☆1,023Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,615Updated this week
- Deep learning for audio denoising☆687Updated last year
- A must-read paper for speech separation based on neural networks☆770Updated 2 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,036Updated 4 months ago
- ☆1,022Updated this week
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆656Updated 2 months ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆950Updated last year
- Large, modern dataset for speech recognition☆666Updated last year
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,527Updated 2 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆480Updated 3 years ago