Detecting and correction dysfluencies/stuttering/stammering in audio files
☆10Apr 23, 2023Updated 2 years ago
Alternatives and similar repositories for Dysfluency-detection-and-correction
Users that are interested in Dysfluency-detection-and-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…☆19Feb 10, 2023Updated 3 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- Final semester project on Stuttered Speech recognition☆17Sep 29, 2017Updated 8 years ago
- Fluent is an AI Augmented Writing Tool that assists People who Stutter write scripts which they can speak fluently☆18Aug 26, 2022Updated 3 years ago
- Simple Delayed Auditory Feedback (DAF) generator. An anti-stuttering tool☆13May 10, 2020Updated 5 years ago
- This the code of paper "Generative Adversarial Network Based Abnormal Behavior Detection in Massive Crowd Videos: A Hajj Case Study"☆10Jun 8, 2021Updated 4 years ago
- Disfluency Detection, Removal & Correction: Increase Apparent Public Speaking Fluency By Speech Augmentation (ICASSP '19)☆16Apr 14, 2020Updated 5 years ago
- ☆10Apr 4, 2023Updated 2 years ago
- ☆109Feb 7, 2024Updated 2 years ago
- ☆10Jun 8, 2022Updated 3 years ago
- A Data Set of Software-related Developer Chat Conversations on Slack☆20Apr 23, 2020Updated 5 years ago
- This is a simple implementation of Saavedra-Barrera's paper SAAVEDRA-BARRERA R H. CPU Performance Evaluation and Execution Time Predictio…☆10Nov 23, 2021Updated 4 years ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆17Sep 13, 2020Updated 5 years ago
- YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection☆20Mar 4, 2025Updated last year
- ☆30Feb 11, 2025Updated last year
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 10 months ago
- ☆37Jun 22, 2022Updated 3 years ago
- A Docker image for a relatively light-weight full Arabic speech synthesis system☆31Feb 12, 2021Updated 5 years ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 4 months ago
- ☆11Oct 20, 2022Updated 3 years ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆19Jul 22, 2025Updated 8 months ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- ☆18Jun 26, 2025Updated 8 months ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆28Jul 11, 2025Updated 8 months ago
- 网上书店ssm☆20Nov 15, 2018Updated 7 years ago
- ☆15Mar 25, 2024Updated 2 years ago
- ☆11Feb 14, 2025Updated last year
- X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…☆187Mar 18, 2026Updated last week
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago