mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated 2 years ago
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 years ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆40Updated 7 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 11 months ago
- ☆21Updated last week
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 5 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Supervoice Speaker Separation Network☆12Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 3 weeks ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆28Updated this week
- ☆24Updated 2 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Updated 7 months ago
- Analysis of XLS-R for Speech Quality Assessment☆13Updated 5 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 8 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- speaker-disentangled speech linguistic content quantizer☆21Updated 4 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆25Updated 10 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 11 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 3 weeks ago
- Talking Face Generation system☆19Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 7 months ago
- ☆16Updated last year
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆16Updated 4 years ago
- ☆10Updated 3 years ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆24Updated 2 years ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- ☆15Updated last year
- ☆47Updated 8 months ago