Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated 7 months ago
Alternatives and similar repositories for paderbox
Users that are interested in paderbox are comparing it to the libraries listed below
Sorting:
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆72Updated this week
- ☆53May 15, 2025Updated 9 months ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- Python loaders for many Real Room Impulse Response databases☆96Sep 30, 2024Updated last year
- lazy_dataset: Process large datasets as if it was an iterable.☆18Dec 1, 2025Updated 3 months ago
- Collection of EM algorithms for blind source separation of audio signals☆298May 19, 2025Updated 9 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Source code for Multi-resolution Common Fate Transform.☆12Jun 5, 2020Updated 5 years ago
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Apr 14, 2020Updated 5 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Jun 3, 2020Updated 5 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Nov 13, 2020Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Room acoustic simulator with a SOFA file loader.☆23Sep 27, 2024Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.☆56Mar 29, 2021Updated 4 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 2 years ago
- CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning [Official PyTorch implementation]☆22Jun 12, 2025Updated 8 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- Based on https://github.com/fatchord/WaveRNN☆24May 3, 2020Updated 5 years ago
- 4 Hour cuSignal Tutorial - ICASSP 2021 Notebooks☆49Jun 7, 2021Updated 4 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆128Jun 7, 2024Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- A fast implementation of bss_eval metrics for blind source separation☆144Sep 6, 2025Updated 5 months ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆60Apr 7, 2022Updated 3 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Official PyTorch implementation of MVAE for audio source separation☆43Dec 21, 2022Updated 3 years ago
- ☆68Feb 15, 2021Updated 5 years ago
- A Python toolkit for sound source separation.☆166May 6, 2025Updated 9 months ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- PyTorch implementation of NVIDIA WaveGlow with constant memory cost.☆36Jan 28, 2023Updated 3 years ago