goepfert / noise_reduction
Audio De-Noiser using a Convolutional Neural Network Architecture built with Tensorflow.js
☆20Updated last year
Alternatives and similar repositories for noise_reduction:
Users that are interested in noise_reduction are comparing it to the libraries listed below
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, re…☆47Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Project of Singing Voice Conversion.☆14Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 2 months ago
- Streaming Audio Models Examples in JS☆15Updated 10 months ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated 2 months ago
- ☆9Updated 4 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆11Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated this week
- ☆10Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆15Updated 2 years ago
- ☆8Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆12Updated 3 months ago
- A curated list of awesome voice activity detection☆37Updated 3 months ago
- A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU…☆13Updated 9 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 6 months ago
- ☆21Updated last month
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 6 months ago
- Russian phonetical transcription☆9Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 11 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆12Updated 6 months ago