ynop/audiomate

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ynop/audiomate)

ynop / audiomate

Python library for handling audio datasets.

☆139

Alternatives and similar repositories for audiomate

Users that are interested in audiomate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SuperKogito / pydiogment
View on GitHub
Python library for audio augmentation
☆84Jul 6, 2023Updated 3 years ago
ynop / deepspeech-german
View on GitHub
Scripts for training Mozilla's DeepSpeech using german speech data
☆41Jan 22, 2020Updated 6 years ago
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
AppleHolic / pytorch_sound
View on GitHub
Sound Related Deep Learning Tasks boosting repository with pytorch
☆88Jul 25, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
KinWaiCheuk / nnAudio
View on GitHub
Audio processing by using pytorch 1D convolution network
☆1,129May 21, 2026Updated 2 months ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,397Jun 6, 2024Updated 2 years ago
soundata / soundata
View on GitHub
Python library for downloading, loading & working with sound datasets
☆358Jul 14, 2026Updated last week
EvelynZhou / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆12Nov 30, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucieperrotta / computersandmusic
View on GitHub
Notebooks for the EPFL class "Computers and Music".
☆25Aug 20, 2021Updated 4 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
eldrin / MTLMusicRepresentation-PyTorch
View on GitHub
Codebase and utilities for using models trained by multiple music related tasks
☆12Jul 6, 2023Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
interactiveaudiolab / CAQE
View on GitHub
Crowdsourced Audio Quality Evaluation Toolkit
☆55Dec 7, 2022Updated 3 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
juanjobosch / SourceFilterContoursMelody
View on GitHub
Melody extraction based on source-filter modelling
☆27Apr 26, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
faroit / python_audio_loading_benchmark
View on GitHub
Benchmark popular audio i/o packages
☆152Dec 19, 2023Updated 2 years ago
mdangschat / speech-corpus-dl
View on GitHub
Download and preperation tool for free speech corpora.
☆16Apr 28, 2019Updated 7 years ago
genisplaja / diffusion-vocal-sep
View on GitHub
Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)
☆17Feb 16, 2023Updated 3 years ago
jordipons / source-separation-wavenet
View on GitHub
A neural network for end-to-end music source separation
☆24Oct 31, 2018Updated 7 years ago
AASHISHAG / deepspeech-german
View on GitHub
Automatic Speech Recognition (ASR) - German
☆321Feb 16, 2023Updated 3 years ago
interactiveaudiolab / voogle
View on GitHub
This is code for an audio search engine that uses vocal imitations of the desired sound
☆38May 16, 2023Updated 3 years ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / tdfbanks
View on GitHub
Pytorch implementation of time-domain filterbanks
☆113Sep 16, 2021Updated 4 years ago
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
wangyu / rethink-audio-fsl
View on GitHub
Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)
☆43May 24, 2022Updated 4 years ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,160Nov 24, 2025Updated 7 months ago
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2020
View on GitHub
Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf
☆63Jul 13, 2026Updated last week
linto-ai / pyrtstools
View on GitHub
Tools for speech processing, keyword spotting
☆16Mar 11, 2020Updated 6 years ago