Audio processing by using pytorch 1D convolution network
☆1,124May 21, 2026Updated last week
Alternatives and similar repositories for nnAudio
Users that are interested in nnAudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Collection of audio-focused loss functions in PyTorch☆866Jul 30, 2024Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,148Nov 24, 2025Updated 6 months ago
- A library for speech data augmentation in time-domain☆687Aug 30, 2021Updated 4 years ago
- ☆511Jun 25, 2024Updated last year
- Fast PyTorch based DSP for audio and 1D signals☆456Feb 17, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch☆516Oct 28, 2023Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,275Apr 13, 2026Updated last month
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆1,050Jul 5, 2023Updated 2 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆379Feb 16, 2026Updated 3 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,567May 13, 2026Updated 2 weeks ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆382Mar 24, 2023Updated 3 years ago
- Pytorch implementation of the CREPE pitch tracker☆515May 16, 2025Updated last year
- An STFT/iSTFT for PyTorch.☆371Oct 31, 2023Updated 2 years ago
- Python library for working with Music Information Retrieval datasets☆407Feb 6, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆526Mar 1, 2022Updated 4 years ago
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- Python library for downloading, loading & working with sound datasets☆355Sep 23, 2025Updated 8 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,879May 21, 2026Updated last week
- A test bed for updates and new features | pytorch/audio☆171May 17, 2020Updated 6 years ago
- A differentiable version of SPTK☆200May 18, 2026Updated last week
- ☆438Nov 1, 2023Updated 2 years ago
- Audio transformations library for PyTorch☆239Apr 19, 2022Updated 4 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,555Mar 12, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆771Jan 4, 2026Updated 4 months ago
- kapre: Keras Audio Preprocessors☆946May 17, 2026Updated last week
- Official implementation of SawSing (ISMIR'22)☆275Aug 28, 2022Updated 3 years ago
- Self-supervised learning for real-time pitch estimation☆291Oct 15, 2025Updated 7 months ago
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆596Jul 18, 2025Updated 10 months ago
- Benchmark popular audio i/o packages☆152Dec 19, 2023Updated 2 years ago
- Open-Unmix - Music Source Separation for PyTorch☆1,485Jun 17, 2024Updated last year
- CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)☆1,383Aug 19, 2024Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,805Jan 26, 2026Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Pitch Estimating Neural Networks (PENN)☆272Apr 2, 2025Updated last year
- Audio generation using diffusion models, in PyTorch.☆2,102Jun 12, 2023Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆409Jul 7, 2021Updated 4 years ago
- Evaluation functions for music/audio information retrieval/signal processing algorithms.☆700Feb 19, 2026Updated 3 months ago
- ☆261Feb 14, 2024Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,640Apr 22, 2024Updated 2 years ago
- A flexible source separation library in Python☆646Dec 9, 2024Updated last year