goepfert / noise_reductionLinks
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆21Updated 2 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 7 months ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆32Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- ☆17Updated 4 years ago
- An even smaller speech recognizer / force aligner☆36Updated 11 months ago
- ☆21Updated this week
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated 2 years ago
- On-device noise suppression powered by deep learning☆77Updated last week
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last month
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Updated 8 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆15Updated 6 months ago
- ☆29Updated last year
- ☆16Updated 7 months ago
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- ☆19Updated 8 months ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 4 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆47Updated last week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 weeks ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 10 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Updated 8 months ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- A curated list of awesome voice activity detection☆69Updated last year
- ☆13Updated 10 years ago
- ☆11Updated 3 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆122Updated last month