mhussam-ai / StimulerVoiceX
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆10Updated last year
Alternatives and similar repositories for StimulerVoiceX:
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated this week
- Uses machine learning to denoise audio containing speech☆31Updated 7 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 2 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 3 weeks ago
- Supervoice Speaker Separation Network☆12Updated 8 months ago
- ☆10Updated 3 months ago
- ☆24Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- ☆23Updated last year
- Finally, some decent sample sentences☆22Updated last year
- ☆39Updated 3 months ago
- Analysis of XLS-R for Speech Quality Assessment☆13Updated last week
- GPT for FACodec☆13Updated 10 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 6 months ago
- A simple voice conversion tool☆17Updated 2 years ago
- Official implementation of Self-Remixing☆13Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 6 months ago
- Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…☆12Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- Codebase and project page for EDMSound☆34Updated last year
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆32Updated 2 months ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 6 months ago
- ☆63Updated 10 months ago
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆19Updated last year