mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
β12Updated 2 years ago
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- π Run Ollama AI (Llama 3, Phi-3) on Colab/local & access via your custom domain using Cloudflare Tunnel. Easy guides for your personal, β¦β23Updated last month
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedbackβ¦β10Updated last week
- Misc. tools/scripts that I made to use for tortoiseβ21Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated 2 weeks ago
- β14Updated last year
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)β20Updated 5 years ago
- Make Kanye sing any song ya want π€π₯β25Updated 2 years ago
- β39Updated last year
- β181Updated 8 months ago
- Video restoration Processing Pipelineβ33Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very β¦β46Updated 9 months ago
- Run Retrieval-based Voice Conversion training and inference with ease.β11Updated 8 months ago
- Performs the entire AI cover generation process with UIβ24Updated last month
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- Official Implementation of StyleTTS-VCβ191Updated 8 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.β102Updated 9 months ago
- AudioLDM text to audio colabβ19Updated last year
- β10Updated last year
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tranβ¦β41Updated this week
- The official Implementation of PeriodWave and PeriodWave-Turboβ207Updated 5 months ago
- Talking Face Generation systemβ19Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,β¦β78Updated 11 months ago
- TU Darmstadt - Deep Learning: Architectures & Methods Project SS21β36Updated 9 months ago
- Zero-Shot Emotion Style Transferβ49Updated 5 months ago
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translationβ181Updated last month
- an improved version of Real-time-voice-cloningβ50Updated last year
- Talking head animationβ27Updated last year
- optimized wav2lipβ18Updated last year
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.β34Updated 6 months ago
- Music production for silent film clips.β28Updated 4 months ago