mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated 2 years ago
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- Misc. tools/scripts that I made to use for tortoise☆21Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 10 months ago
- ☆14Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆42Updated 9 months ago
- ☆39Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆23Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated 11 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆68Updated 10 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Performs the entire AI cover generation process with UI☆23Updated last month
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 2 months ago
- Uses machine learning to denoise audio containing speech☆39Updated last year
- Awesome music generation model——MG²☆159Updated 5 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆78Updated 11 months ago
- VALL-E 2 reproduction☆129Updated last year
- Official Implementation of StyleTTS-VC☆190Updated 7 months ago
- AudioLDM text to audio colab☆19Updated last year
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆177Updated last month
- Zero-Shot Emotion Style Transfer☆49Updated 4 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆206Updated 4 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆47Updated 5 months ago
- ☆181Updated 8 months ago
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Updated 7 months ago
- An unofficial PyTorch implementation of VALL-E☆88Updated last month
- ☆116Updated 6 months ago
- Prepare spectrograms from audio for training a Riffusion model☆16Updated 2 years ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆100Updated 2 months ago
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆225Updated last year
- A simple voice conversion tool☆18Updated 3 years ago