⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆37May 8, 2026Updated last month
Alternatives and similar repositories for fast-audiomentations
Users that are interested in fast-audiomentations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet Another Config Library for C++☆10Sep 21, 2018Updated 7 years ago
- Header-Only Collection of Clustering Algorithms for C++☆63Apr 26, 2026Updated 2 months ago
- Simple implement of ECS on C++☆16May 29, 2018Updated 8 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- Base for building Figma plugins with React☆16Jul 20, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆46Jun 11, 2025Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆14Aug 25, 2023Updated 2 years ago
- Simple fluid simulation right in your terminal☆49Mar 14, 2026Updated 3 months ago
- Streaming Vocos☆31Jun 10, 2025Updated last year
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆23Jun 7, 2025Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆45Jul 24, 2023Updated 2 years ago
- A Responsive Swipeable Carousel☆20Apr 15, 2014Updated 12 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Automatic gain control library☆15Jul 13, 2024Updated last year
- Python bindings for Wuffs the Library☆19Apr 5, 2025Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆38Jan 17, 2024Updated 2 years ago
- ☆24Jul 29, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆30Nov 12, 2025Updated 7 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated 2 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- ☆12Nov 7, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fast Russian Text normalization for TTS using only RegEx.☆32Jun 17, 2026Updated last week
- ☆130Aug 19, 2024Updated last year
- ☆26Mar 20, 2024Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆19Jul 17, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- ☆14Jun 16, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆23Feb 2, 2022Updated 4 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆14Mar 11, 2025Updated last year
- Minimal, predictable, footgun-free config library.☆42May 28, 2026Updated last month
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆21Aug 20, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆31Sep 20, 2025Updated 9 months ago
- ☆15Nov 11, 2024Updated last year
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago