sam-dev-coder / StimulerVoiceX

StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.

☆10

Alternatives and similar repositories for StimulerVoiceX:

Users that are interested in StimulerVoiceX are comparing it to the libraries listed below

sony / diffusion-timbre-transfer
☆33Updated 2 months ago
philgzl / brever
Speech enhancement in noisy and reverberant environments using deep neural networks
☆20Updated 3 months ago
mbrotos / SoundSeg
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
☆12Updated last month
kyegomez / SoundStream
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆12Updated this week
fmiotello / fastVC
A simple voice conversion tool
☆17Updated 2 years ago
eloimoliner / audio-inpainting-diffusion
☆62Updated 9 months ago
vtuber-plan / hifi-gan
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆30Updated last year
AgentCooper2002 / EDMSound
Codebase and project page for EDMSound
☆33Updated last year
jwr1995 / PubSep
Repository of published DNN speech separation recipes for a number of datasets
☆10Updated last year
naba89 / iSeparate-SDX
iSeparate library for the SDX2023 challenge
☆13Updated last year
uthree / ddsp-vocoder
☆10Updated 2 months ago
JarodMica / tortoise_dataset_tools
Misc. tools/scripts that I made to use for tortoise
☆21Updated 5 months ago
dmse4tts / DMSE4TTS
☆24Updated last year
ryota-komatsu / speaker_disentangled_hubert
Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"
☆32Updated last week
kohei0209 / self-remixing
Official implementation of Self-Remixing
☆13Updated 11 months ago
anton-jeran / MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆44Updated 4 months ago
ldzhangyx / MusicMagus
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆35Updated 4 months ago
AI-S2-Lab / EmoPP
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Updated 5 months ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆53Updated last week
iiscleap / ZEST
Zero-Shot Emotion Style Transfer
☆41Updated 9 months ago
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆17Updated this week
WangHelin1997 / SoloAudio
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆74Updated last month
jose-solorzano / audio-denoiser
Uses machine learning to denoise audio containing speech
☆31Updated 7 months ago
freds0 / free-svc
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆39Updated 2 weeks ago
eloimoliner / BABE2-music-restoration
☆38Updated 9 months ago
justinjohn0306 / SpeedScribe
High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…
☆10Updated 3 months ago
Sobsz / tts-dataset-prompts
Finally, some decent sample sentences
☆22Updated last year
lcn-kul / xls-r-analysis-sqa
Analysis of XLS-R for Speech Quality Assessment
☆12Updated 5 months ago
donghoney0416 / DeFTAN-II
Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…
☆12Updated 2 months ago
AlekseyKorshuk / accompaniment-generator
Generate accompaniment part with chords using Evolutionary algorithm.
☆8Updated 2 years ago