Lallapallooza / fast-audiomentationsView external linksLinks
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆35Jan 19, 2024Updated 2 years ago
Alternatives and similar repositories for fast-audiomentations
Users that are interested in fast-audiomentations are comparing it to the libraries listed below
Sorting:
- Yet Another Config Library for C++☆10Sep 21, 2018Updated 7 years ago
- Header-Only Collection of Clustering Algorithms for C++☆63Jan 31, 2024Updated 2 years ago
- Simple implement of ECS on C++☆16May 29, 2018Updated 7 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆14Aug 25, 2023Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- ☆44Jun 11, 2025Updated 8 months ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Nov 12, 2025Updated 3 months ago
- ☆21Jul 29, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Base for building Figma plugins with React☆16Jul 20, 2022Updated 3 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- A Rust-based, SenseVoiceSmall☆23Jan 12, 2026Updated last month
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆15Jul 14, 2020Updated 5 years ago
- ☆11Nov 7, 2024Updated last year
- ☆26Mar 20, 2024Updated last year
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- ☆15Nov 11, 2024Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆27Sep 20, 2025Updated 4 months ago
- ☆11Mar 22, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆12Jun 22, 2025Updated 7 months ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Automatic gain control library☆15Jul 13, 2024Updated last year
- ☆27Oct 25, 2024Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago