sam-dev-coder / StimulerVoiceX
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for StimulerVoiceX
- Codebase and project page for EDMSound☆29Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- ☆61Updated 7 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆31Updated 2 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- Misc. tools/scripts that I made to use for tortoise☆18Updated 3 months ago
- Uses machine learning to denoise audio containing speech☆29Updated 5 months ago
- ☆23Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month
- Finally, some decent sample sentences☆22Updated 11 months ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 3 months ago
- GPT for FACodec☆13Updated 7 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆11Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated last week
- ☆48Updated last year
- ☆23Updated last year
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆33Updated 2 years ago
- Implementation of Emo-StarGAN☆46Updated 11 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆17Updated 2 months ago
- Use DTLN real time speech denoising model(https://github.com/breizhn/DTLN) in web.☆10Updated last year
- Official implementation of Self-Remixing☆11Updated 9 months ago
- iSeparate library for the SDX2023 challenge☆13Updated 11 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated last year
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆48Updated 3 weeks ago
- Analysis of XLS-R for Speech Quality Assessment☆11Updated 3 months ago