Object-oriented handling of audio data, with GPU-powered augmentations, and more.
☆339Apr 1, 2025Updated 11 months ago
Alternatives and similar repositories for audiotools
Users that are interested in audiotools are comparing it to the libraries listed below
Sorting:
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,737Jan 26, 2026Updated last month
- AcademiCodec: An Open Source Audio Codec Model for Academic Research☆670Dec 27, 2023Updated 2 years ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆1,086Aug 7, 2024Updated last year
- Collection of audio-focused loss functions in PyTorch☆855Jul 30, 2024Updated last year
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆755Nov 19, 2024Updated last year
- Pytorch implementation of the CREPE pitch tracker☆511May 16, 2025Updated 10 months ago
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆62Jun 30, 2025Updated 8 months ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,195Sep 5, 2024Updated last year
- Audio Codec Speech processing Universal PERformance Benchmark☆299Jan 8, 2026Updated 2 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- A lightweight library for Frechet Audio Distance calculation.☆312Feb 11, 2026Updated last month
- An Open-source Streaming High-fidelity Neural Audio Codec☆500Mar 4, 2025Updated last year
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆442Jan 25, 2024Updated 2 years ago
- Audiogen Codec☆144Jul 9, 2024Updated last year
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 11 months ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆372Sep 3, 2024Updated last year
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆284Updated this week
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆650Jun 9, 2024Updated last year
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Jun 11, 2024Updated last year
- Keep track of big models in audio domain, including speech, singing, music etc.☆506Sep 26, 2024Updated last year
- A differentiable version of SPTK☆196Feb 26, 2026Updated 3 weeks ago
- The Open Source Code of UniAudio☆605Jul 22, 2024Updated last year
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- Simple package for binding functions to CLI or config files.☆47Aug 11, 2024Updated last year
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆151Feb 11, 2023Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,137Nov 24, 2025Updated 3 months ago
- ☆179Oct 24, 2023Updated 2 years ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)☆283Oct 8, 2021Updated 4 years ago
- MOS score prediction by fine-tuned wav2vec2.0 model☆176Oct 20, 2022Updated 3 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Jan 18, 2024Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆270Jul 29, 2023Updated 2 years ago
- Fast PyTorch based DSP for audio and 1D signals☆452Feb 17, 2025Updated last year
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆136Feb 3, 2025Updated last year
- Differentiable audio signal processors in PyTorch☆287Dec 4, 2023Updated 2 years ago
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆408May 30, 2023Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆45May 18, 2023Updated 2 years ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆121Jan 24, 2023Updated 3 years ago