goepfert / noise_reductionLinks
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆20Updated 2 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆31Updated this week
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- ☆27Updated last month
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆35Updated 2 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- An even smaller speech recognizer / force aligner☆35Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated last year
- Heteronym to Phoneme Parser☆18Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆20Updated this week
- ☆17Updated 4 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- ☆18Updated 4 months ago
- Add n-gram and large language model (LLM) support to Whisper models.☆31Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Updated 4 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 2 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Audio tokenization, in the fastest way possible!☆52Updated 11 months ago
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- Onnx compatible styletts2 code☆12Updated last month
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆30Updated last year
- StyleTTS 2 Optimized Training Fork☆33Updated 6 months ago
- High quality text-to-speech based on StyleTTS 2.☆57Updated this week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.