mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated 2 years ago
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated 2 weeks ago
- ☆40Updated last year
- ☆14Updated last year
- Talking Face Generation system☆19Updated 2 years ago
- ☆10Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 5 months ago
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 9 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆47Updated 11 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 6 months ago
- A simple voice conversion tool☆19Updated 3 years ago
- create dataset from list of youtube links easily☆21Updated 2 years ago
- Analysis of XLS-R for Speech Quality Assessment☆14Updated 8 months ago
- Uses machine learning to denoise audio containing speech☆44Updated last year
- Your one-stop solution for voice dataset creation☆127Updated last year
- Performs the entire AI cover generation process with UI☆25Updated 3 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆192Updated 3 months ago
- Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…☆84Updated 2 years ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆24Updated 7 months ago
- Implementation of Emo-StarGAN☆45Updated last year
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆102Updated 4 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆23Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆16Updated 2 weeks ago
- Official Implementation of StyleTTS-VC☆191Updated 9 months ago
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆111Updated 2 weeks ago
- An unofficial PyTorch implementation of VALL-E☆88Updated 3 months ago
- Make Kanye sing any song ya want 🎤🔥☆25Updated 2 years ago