mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated last year
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 5 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated this week
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆26Updated last month
- ☆24Updated last month
- speaker-disentangled speech linguistic content quantizer☆19Updated 3 months ago
- Analysis of XLS-R for Speech Quality Assessment☆13Updated 4 months ago
- ☆10Updated 7 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- A simple voice conversion tool☆17Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Official implementation of Self-Remixing☆14Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆24Updated 4 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Supervoice Speaker Separation Network☆12Updated last year
- ☆23Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆27Updated 4 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- GPT for FACodec☆13Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 3 weeks ago
- Official implementation for FlowSep☆52Updated 5 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 6 months ago
- Implementation of Emo-StarGAN☆45Updated last year
- ☆35Updated last year
- Official PyTorch implementation of TTS Style Transfer☆23Updated 3 years ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆40Updated 6 months ago
- Finally, some decent sample sentences☆23Updated last year
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 5 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆17Updated 4 months ago