Benchmark popular audio i/o packages
☆151Dec 19, 2023Updated 2 years ago
Alternatives and similar repositories for python_audio_loading_benchmark
Users that are interested in python_audio_loading_benchmark are comparing it to the libraries listed below
Sorting:
- A test bed for updates and new features | pytorch/audio☆171May 17, 2020Updated 5 years ago
- Fast PyTorch based DSP for audio and 1D signals☆452Feb 17, 2025Updated last year
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- A library for speech data augmentation in time-domain☆682Aug 30, 2021Updated 4 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Jul 1, 2022Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Oct 30, 2020Updated 5 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Dec 3, 2024Updated last year
- An open-source Python library for audio time-scale modification.☆226Dec 20, 2023Updated 2 years ago
- Utilities for resampling and filtering audio data☆47Jan 9, 2020Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Tool to aid in the creation of mashups☆19Apr 7, 2020Updated 5 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Aug 3, 2023Updated 2 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- museval - source separation evaluation tools for python☆232May 28, 2025Updated 9 months ago
- Tools for handling multimodal data in machine learning projects.☆1,114Updated this week
- ☆508Jun 25, 2024Updated last year
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.☆56Mar 29, 2021Updated 4 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 6 months ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212May 30, 2025Updated 9 months ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,135Nov 24, 2025Updated 3 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Dec 25, 2020Updated 5 years ago
- ☆99Nov 25, 2021Updated 4 years ago
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆758Jan 4, 2026Updated last month
- Official code for Wav2Seq☆97Jul 19, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- source code of "End-to-end Music Remastering System Using Self-supervised and Adversarial Training"☆47Sep 7, 2023Updated 2 years ago
- Collection of audio-focused loss functions in PyTorch☆851Jul 30, 2024Updated last year
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆366Feb 16, 2026Updated last week