mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated 2 years ago
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- Misc. tools/scripts that I made to use for tortoise☆21Updated 11 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 9 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆41Updated 8 months ago
- ☆14Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆66Updated 2 weeks ago
- ☆39Updated last year
- A simple voice conversion tool☆18Updated 3 years ago
- Talking Face Generation system☆19Updated last year
- Uses machine learning to denoise audio containing speech☆37Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Zero-Shot Emotion Style Transfer☆49Updated 3 months ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 2 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- AudioLDM text to audio colab☆19Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆75Updated 10 months ago
- Video restoration Processing Pipeline☆31Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- Performs the entire AI cover generation process with UI☆22Updated last week
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- Official Implementation of StyleTTS-VC☆188Updated 7 months ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 5 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 10 months ago
- Music production for silent film clips.☆26Updated 3 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆22Updated 4 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆26Updated 2 months ago
- Talking head animation☆27Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆67Updated 10 months ago
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆22Updated 10 months ago