Lallapallooza / fast-audiomentationsLinks
β‘ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
β34Updated last year
Alternatives and similar repositories for fast-audiomentations
Users that are interested in fast-audiomentations are comparing it to the libraries listed below
Sorting:
- π΅ muse: Music Separationβ10Updated last year
- β56Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inferenceβ27Updated 10 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β21Updated 6 months ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightningβ17Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 7 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β27Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Updated last year
- Unofficial implementation of wavenext vocoderβ53Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arraysβ48Updated last month
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networksβ17Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelationβ36Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Updated last year
- β29Updated 10 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β32Updated 2 years ago
- Adaptive Vocoder for Custom Voiceβ61Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessmentβ16Updated 3 years ago
- Deep Speech Distances PyTorchβ29Updated 3 years ago
- β14Updated 4 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPAβ16Updated last year
- Official repository of Wavehax vocoderβ62Updated last week
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β33Updated last month
- β23Updated last year
- β25Updated last year
- β11Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.β23Updated 4 months ago
- Supervoice diffusion enhanceβ27Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year