Python library for handling audio datasets.
☆138Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for audiomate
Users that are interested in audiomate are comparing it to the libraries listed below
Sorting:
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- This is code for an audio search engine that uses vocal imitations of the desired sound☆38May 16, 2023Updated 2 years ago
- Notebooks for the EPFL class "Computers and Music".☆25Aug 20, 2021Updated 4 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆88Jul 25, 2024Updated last year
- ☆14Jun 12, 2015Updated 10 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Python library for downloading, loading & working with sound datasets☆350Sep 23, 2025Updated 5 months ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 7 months ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Crowdsourced Audio Quality Evaluation Toolkit☆55Dec 7, 2022Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Benchmark popular audio i/o packages☆151Dec 19, 2023Updated 2 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Zero-data (yet trainable) probabilistic fundamental frequency estimator.☆19Jun 9, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Jul 6, 2023Updated 2 years ago
- ☆68Feb 15, 2021Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆22Jul 24, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- Code for paper submission under review.☆35Oct 30, 2017Updated 8 years ago
- ☆13Oct 27, 2021Updated 4 years ago