mhussam-ai / StimulerVoiceXLinks
StimulerVoiceX is a denoising and speech enhancement system. It uses deep learning techniques to remove noise from speech signals and improve their quality and clarity. It can handle various types of noise, such as white noise, babble noise, or environmental noise. It can also enhance speech features, such as volume, pitch, or timbre.
☆12Updated last year
Alternatives and similar repositories for StimulerVoiceX
Users that are interested in StimulerVoiceX are comparing it to the libraries listed below
Sorting:
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 2 months ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 4 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆25Updated 3 weeks ago
- A simple voice conversion tool☆17Updated 3 years ago
- speaker-disentangled speech linguistic content quantizer☆16Updated 2 months ago
- ☆24Updated last month
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated last week
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 9 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 5 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 6 months ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- ☆22Updated 3 years ago
- ☆10Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- GPT for FACodec☆13Updated last year
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 9 months ago
- Analysis of XLS-R for Speech Quality Assessment☆13Updated 3 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Official Code for ParrotTTS☆51Updated 7 months ago
- Uses machine learning to denoise audio containing speech☆34Updated 11 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated this week
- Supervoice Speaker Separation Network☆12Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 2 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Use DTLN real time speech denoising model(https://github.com/breizhn/DTLN) in web.☆13Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 3 months ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- Zero-Shot Emotion Style Transfer☆45Updated last month
- ☆67Updated last year
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆12Updated 10 months ago