sam-dev-coder / StimulerVoiceX
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for StimulerVoiceX
- A simple voice conversion tool☆15Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- ☆23Updated last year
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆17Updated last week
- Analysis of XLS-R for Speech Quality Assessment☆11Updated 3 months ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 6 months ago
- Codebase and project page for EDMSound☆29Updated 11 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆11Updated 3 months ago
- Implementation of Emo-StarGAN☆46Updated 10 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- ☆23Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆62Updated 2 weeks ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆26Updated last year
- ☆61Updated 7 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated this week
- ☆19Updated 2 weeks ago
- Uses machine learning to denoise audio containing speech☆29Updated 4 months ago
- Diffusion Model for Voice Conversion☆38Updated 8 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated last week
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆28Updated 3 weeks ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆33Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆50Updated 2 years ago
- Official implementation of Self-Remixing☆11Updated 9 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆16Updated 2 months ago