jordicapde / stutter-formerLinks
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.
☆19Updated 3 years ago
Alternatives and similar repositories for stutter-former
Users that are interested in stutter-former are comparing it to the libraries listed below
Sorting:
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Updated 10 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110Updated 8 months ago
- Official Code for ParrotTTS☆58Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Updated 10 months ago
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆118Updated 5 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated 2 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Updated last month
- ☆106Updated 4 months ago
- All generative model in one for better TTS model☆74Updated last year
- ☆24Updated 9 months ago
- Zero-Shot Emotion Style Transfer☆49Updated 9 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Updated last year
- ☆44Updated last year
- Official release of StyleTalk dataset.☆72Updated last year
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆43Updated last year
- ☆59Updated 3 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆147Updated 8 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Updated 3 weeks ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Updated 6 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆69Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Updated last year
- An unofficial PyTorch implementation of VALL-E☆88Updated 6 months ago
- poorman's ar-dit tts☆45Updated last month
- The open source code for SimpleSpeech series☆145Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆54Updated last year
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Updated last year
- ConMamba for Automatic Speech Recognition☆101Updated last year