goepfert / noise_reductionLinks
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
β21Updated 2 years ago
Alternatives and similar repositories for noise_reduction
Users that are interested in noise_reduction are comparing it to the libraries listed below
Sorting:
- SpeechPlus: Small LLM-Based Text-to-Speech Library πβ20Updated 7 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed dataβ13Updated 9 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 3 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β26Updated 4 years ago
- IPA Phonemizer/Dephonemizer for 140 human languagesβ50Updated last week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago
- β11Updated 4 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.β138Updated 3 months ago
- β17Updated 4 years ago
- Transfer learning approach to pronunciation scoringβ11Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Updated last year
- On-device noise suppression powered by deep learningβ80Updated last week
- β19Updated 10 months ago
- Open TTS models, built for streaming on the edgeβ44Updated 9 months ago
- Evaluation of STT models for german languageβ15Updated 3 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β46Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)β15Updated 11 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)β10Updated last year
- StyleTTS2 + Vocos as a Decoderβ13Updated 9 months ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.β19Updated last year
- Launch your speech synthesis within one minute.β12Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β13Updated last year
- Uses machine learning to denoise audio containing speechβ48Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.β20Updated last year
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ14Updated 2 months ago
- Add n-gram and large language model (LLM) support to Whisper models.β40Updated 8 months ago
- β17Updated 2 years ago