AlexPeiris7/Dysfluency-detection-and-correction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlexPeiris7/Dysfluency-detection-and-correction)

AlexPeiris7 / Dysfluency-detection-and-correction

Detecting and correction dysfluencies/stuttering/stammering in audio files

☆10

Alternatives and similar repositories for Dysfluency-detection-and-correction

Users that are interested in Dysfluency-detection-and-correction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jordicapde / stutter-former
View on GitHub
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…
☆19Feb 10, 2023Updated 3 years ago
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
khannasarthak / Stuttered-Speech-recognition
View on GitHub
Final semester project on Stuttered Speech recognition
☆17Sep 29, 2017Updated 8 years ago
bhavyaghai / Fluent
View on GitHub
Fluent is an AI Augmented Writing Tool that assists People who Stutter write scripts which they can speak fluently
☆18Aug 26, 2022Updated 3 years ago
snovvcrash / daf-generator
View on GitHub
Simple Delayed Auditory Feedback (DAF) generator. An anti-stuttering tool
☆13May 10, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
superYong2020 / hajj_abnormal_behavior_detection
View on GitHub
This the code of paper "Generative Adversarial Network Based Abnormal Behavior Detection in Massive Crowd Videos: A Hajj Case Study"
☆12Jun 8, 2021Updated 5 years ago
sagniklp / Disfluency-Removal-API
View on GitHub
Disfluency Detection, Removal & Correction: Increase Apparent Public Speaking Fluency By Speech Augmentation (ICASSP '19)
☆16Apr 14, 2020Updated 6 years ago
BenTMatthews / StackAPIDemo
View on GitHub
☆10Apr 4, 2023Updated 3 years ago
apple / ml-stuttering-events-dataset
View on GitHub
☆113Feb 7, 2024Updated 2 years ago
th-nuernberg / ml-stuttering-events-dataset-extended
View on GitHub
☆10Jun 8, 2022Updated 4 years ago
preethac / Software-related-Slack-Chats-with-Disentangled-Conversations
View on GitHub
A Data Set of Software-related Developer Chat Conversations on Slack
☆21Apr 23, 2020Updated 6 years ago
konverner / morpholog
View on GitHub
Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes
☆17Sep 13, 2020Updated 5 years ago
YYYYYHC / Measure_Cache_Size
View on GitHub
This is a simple implementation of Saavedra-Barrera's paper SAAVEDRA-BARRERA R H. CPU Performance Evaluation and Execution Time Predictio…
☆10Nov 23, 2021Updated 4 years ago
rorizzz / YOLO-Stutter
View on GitHub
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆21Mar 4, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
raihankhan-rk / AgentNet
View on GitHub
☆30Feb 11, 2025Updated last year
circle-hit / Lens
View on GitHub
Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"
☆12Oct 15, 2024Updated last year
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
l3das / L3DAS21
View on GitHub
☆37Jun 22, 2022Updated 4 years ago
nawarhalabi / festival-tts-arabic-voices-docker
View on GitHub
A Docker image for a relatively light-weight full Arabic speech synthesis system
☆31Feb 12, 2021Updated 5 years ago
nafiuny / ICRCycleGAN-VC
View on GitHub
Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny
☆15Apr 15, 2026Updated 2 months ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
azmat21 / UyghurTextResource
View on GitHub
uyghur text resource crawled from website
☆12Dec 25, 2015Updated 10 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
BUTSpeechFIT / hystoc
View on GitHub
Getting confidences from any end-to-end systems
☆11May 24, 2023Updated 3 years ago
Honee-W / U-SAM
View on GitHub
Official repository for U-SAM (Interspeech 2025)
☆28Jun 3, 2025Updated last year
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated last year
AV-Reasoner / AV-Reasoner
View on GitHub
☆19Jul 22, 2025Updated 11 months ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
cai-cong / MER25_personality
View on GitHub
☆21Jun 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fcwrwen / booksys
View on GitHub
网上书店ssm
☆20Nov 15, 2018Updated 7 years ago
pashanitw / W2V2-BERT-ASR-Training
View on GitHub
☆15Mar 25, 2024Updated 2 years ago
speechwellness / SpeechWellness-1_Baseline
View on GitHub
☆11Feb 14, 2025Updated last year
the-bird-F / GLM-Voice-RAG
View on GitHub
[EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…
☆31Jul 11, 2025Updated 11 months ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
pengzhendong / audiolab
View on GitHub
A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)
☆39Mar 31, 2026Updated 3 months ago
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago