mhussam-ai / StimulerVoiceX
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆10Updated last year
Alternatives and similar repositories for StimulerVoiceX:
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
- Speech enhancement in noisy and reverberant environments using deep neural networks☆19Updated last week
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 2 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- ☆24Updated last year
- Uses machine learning to denoise audio containing speech☆32Updated 9 months ago
- ☆10Updated 5 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- Implementation of Emo-StarGAN☆45Updated last year
- GPT for FACodec☆13Updated last year
- ☆23Updated last year
- Analysis of XLS-R for Speech Quality Assessment☆13Updated 2 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 7 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 3 months ago
- Zero-Shot Emotion Style Transfer☆43Updated last year
- Supervoice Speaker Separation Network☆12Updated 10 months ago
- ☆40Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated this week
- ☆65Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 5 months ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆23Updated 3 weeks ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 4 months ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- ☆13Updated 7 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆60Updated 2 months ago
- Official implementation of Self-Remixing☆13Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year