goepfert / noise_reductionLinks
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆21Updated 2 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆17Updated 7 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 8 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆135Updated 2 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 9 months ago
- Open TTS models, built for streaming on the edge☆44Updated 9 months ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 7 months ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- ☆11Updated 3 months ago
- IPA Phonemizer/Dephonemizer for 144 human languages☆49Updated this week
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆23Updated 4 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated 2 weeks ago
- ☆17Updated 4 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆34Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago
- ☆16Updated 8 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆19Updated 9 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆22Updated 4 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Tunable pipelines☆40Updated 3 months ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- A curated list of awesome voice activity detection☆70Updated last year
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated last year