⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆36May 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for fast-audiomentations
Users that are interested in fast-audiomentations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet Another Config Library for C++☆10Sep 21, 2018Updated 7 years ago
- Header-Only Collection of Clustering Algorithms for C++☆63Apr 26, 2026Updated last month
- Simple implement of ECS on C++☆16May 29, 2018Updated 8 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Base for building Figma plugins with React☆16Jul 20, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆45Jun 11, 2025Updated 11 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆14Aug 25, 2023Updated 2 years ago
- Simple fluid simulation right in your terminal☆49Mar 14, 2026Updated 2 months ago
- Streaming Vocos☆31Jun 10, 2025Updated 11 months ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated 11 months ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- A Responsive Swipeable Carousel☆20Apr 15, 2014Updated 12 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Automatic gain control library☆15Jul 13, 2024Updated last year
- Python bindings for Wuffs the Library☆19Apr 5, 2025Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆38Jan 17, 2024Updated 2 years ago
- ☆23Jul 29, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆28Nov 12, 2025Updated 6 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- ☆12Nov 7, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- ☆130Aug 19, 2024Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 6 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆23Feb 2, 2022Updated 4 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- Minimal, predictable, footgun-free config library.☆42Updated this week
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆19Aug 20, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 8 months ago
- ☆15Nov 11, 2024Updated last year
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago