sam-dev-coder / StimulerVoiceX
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆10Updated last year
Alternatives and similar repositories for StimulerVoiceX:
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
- ☆33Updated 2 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 3 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated last month
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated this week
- A simple voice conversion tool☆17Updated 2 years ago
- ☆62Updated 9 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Codebase and project page for EDMSound☆33Updated last year
- Repository of published DNN speech separation recipes for a number of datasets☆10Updated last year
- iSeparate library for the SDX2023 challenge☆13Updated last year
- ☆10Updated 2 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 5 months ago
- ☆24Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆32Updated last week
- Official implementation of Self-Remixing☆13Updated 11 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 4 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆35Updated 4 months ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week
- Zero-Shot Emotion Style Transfer☆41Updated 9 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆17Updated this week
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆74Updated last month
- Uses machine learning to denoise audio containing speech☆31Updated 7 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆39Updated 2 weeks ago
- ☆38Updated 9 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- Finally, some decent sample sentences☆22Updated last year
- Analysis of XLS-R for Speech Quality Assessment☆12Updated 5 months ago
- Official page of "DeFTAN-II: Efficient multichannel speech enhancement with subgroup processing", IEEE/ACM Transactions on Audio, Speech,…☆12Updated 2 months ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago