hongfeixue/StutteringSpeechChallenge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hongfeixue/StutteringSpeechChallenge)

hongfeixue / StutteringSpeechChallenge

SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

☆12

Alternatives and similar repositories for StutteringSpeechChallenge

Users that are interested in StutteringSpeechChallenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jordicapde / stutter-former
View on GitHub
StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…
☆19Feb 10, 2023Updated 3 years ago
khannasarthak / Stuttered-Speech-recognition
View on GitHub
Final semester project on Stuttered Speech recognition
☆17Sep 29, 2017Updated 8 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
rorizzz / YOLO-Stutter
View on GitHub
YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
☆21Mar 4, 2025Updated last year
apple / ml-stuttering-events-dataset
View on GitHub
☆111Feb 7, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
bhavyaghai / Fluent
View on GitHub
Fluent is an AI Augmented Writing Tool that assists People who Stutter write scripts which they can speak fluently
☆18Aug 26, 2022Updated 3 years ago
snovvcrash / daf-generator
View on GitHub
Simple Delayed Auditory Feedback (DAF) generator. An anti-stuttering tool
☆13May 10, 2020Updated 6 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
kaistmm / V2SFlow
View on GitHub
[ICASSP 2025] V2SFlow: Video-to-Speech Generation with Speech Decomposition and Rectified Flow
☆21Jun 3, 2025Updated last year
konverner / morpholog
View on GitHub
Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes
☆17Sep 13, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YYYYYHC / Measure_Cache_Size
View on GitHub
This is a simple implementation of Saavedra-Barrera's paper SAAVEDRA-BARRERA R H. CPU Performance Evaluation and Execution Time Predictio…
☆10Nov 23, 2021Updated 4 years ago
rhasspy / ipa2kaldi
View on GitHub
Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)
☆10Jun 2, 2021Updated 5 years ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 11 months ago
ASLP-lab / WenetSpeech-Wu-Repo
View on GitHub
A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations
☆171Feb 6, 2026Updated 5 months ago
mubingshen / MLC-SLM-Baseline
View on GitHub
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆51May 14, 2025Updated last year
vivraj17 / Detection-Of-Parkinson-s-Disesase-Using-Voice-Impairments-With-ML-and-LSTM
View on GitHub
Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…
☆12Apr 1, 2019Updated 7 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
circle-hit / Lens
View on GitHub
Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"
☆12Oct 15, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
NUS-HPC-AI-Lab / MoST
View on GitHub
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
☆33Jan 15, 2026Updated 6 months ago
aya015757881 / brainfuck_interpreter
View on GitHub
An interpreter in C for the language brainfuck.
☆11Apr 12, 2023Updated 3 years ago
kyegomez / AudioFlamingo
View on GitHub
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…
☆39Jan 27, 2025Updated last year
lesterphillip / serenade
View on GitHub
A Singing Style Conversion Framework Based On Audio Infilling
☆35Apr 28, 2025Updated last year
alanshaoTT / LAT-Audio-Repo
View on GitHub
☆28Apr 28, 2026Updated 3 months ago
jyhan03 / channel-decorrelation
View on GitHub
multi-channel target speech extraction with channel decorrelation and target speaker adaptation
☆27Feb 19, 2021Updated 5 years ago
kuan2jiu99 / audio-hallucination
View on GitHub
Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024
☆34Mar 14, 2025Updated last year
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
inverse-ai / FINALLY-Speech-Enhancement
View on GitHub
FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.
☆28Apr 1, 2026Updated 3 months ago
Labbeti / aac-metrics
View on GitHub
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆75Mar 22, 2026Updated 4 months ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
imadtoubal / Parkinson-s-Disease-Classification-from-Speech-Data
View on GitHub
Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…
☆14Feb 2, 2020Updated 6 years ago
Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines
View on GitHub
The baselines of ARC-Challenge-Interspeech2026
☆60Dec 1, 2025Updated 7 months ago