goepfert / noise_reductionLinks
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆20Updated 2 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 5 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- An even smaller speech recognizer / force aligner☆35Updated 8 months ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆35Updated 2 weeks ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 5 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- ☆40Updated this week
- Cantonese Text to Speech with VITS implementation☆35Updated 2 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆15Updated last year
- Web app for keyword spotting using TensorflowJS☆73Updated 2 years ago
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆66Updated 2 weeks ago
- Launch your speech synthesis within one minute.☆12Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆40Updated 3 months ago
- Project of Singing Voice Conversion.☆15Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆130Updated 10 months ago
- ☆11Updated last week
- ☆29Updated last year
- A simple, but performant framework for mapping speech directly to categories and intents.☆21Updated last year