⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆35Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for fast-audiomentations
Users that are interested in fast-audiomentations are comparing it to the libraries listed below
Sorting:
- Yet Another Config Library for C++☆10Sep 21, 2018Updated 7 years ago
- Header-Only Collection of Clustering Algorithms for C++☆63Jan 31, 2024Updated 2 years ago
- Simple implement of ECS on C++☆16May 29, 2018Updated 7 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆14Aug 25, 2023Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- ☆45Jun 11, 2025Updated 8 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆25Nov 12, 2025Updated 3 months ago
- ☆21Jul 29, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Base for building Figma plugins with React☆16Jul 20, 2022Updated 3 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆28Feb 21, 2025Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- speex aec kalman filter☆15Mar 17, 2024Updated last year
- ☆15Jul 14, 2020Updated 5 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆11Nov 7, 2024Updated last year
- ☆26Mar 20, 2024Updated last year
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- ☆15Nov 11, 2024Updated last year
- ☆11Mar 22, 2023Updated 2 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Automatic gain control library☆15Jul 13, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 4 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- A Rust-based, SenseVoiceSmall☆27Updated this week
- ☆14Jun 16, 2023Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆27Oct 25, 2024Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago