SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated last year
Alternatives and similar repositories for StutteringSpeechChallenge
Users that are interested in StutteringSpeechChallenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…☆19Feb 10, 2023Updated 3 years ago
- Final semester project on Stuttered Speech recognition☆17Sep 29, 2017Updated 8 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection☆20Mar 4, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆111Feb 7, 2024Updated 2 years ago
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago
- Fluent is an AI Augmented Writing Tool that assists People who Stutter write scripts which they can speak fluently☆18Aug 26, 2022Updated 3 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- Simple Delayed Auditory Feedback (DAF) generator. An anti-stuttering tool☆13May 10, 2020Updated 5 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 11 months ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆26Apr 1, 2026Updated 2 weeks ago
- ☆17Jul 22, 2024Updated last year
- ☆17May 5, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆52Sep 20, 2025Updated 7 months ago
- ☆14Jun 16, 2023Updated 2 years ago
- A Singing Style Conversion Framework Based On Audio Infilling☆33Apr 28, 2025Updated 11 months ago
- An interpreter in C for the language brainfuck.☆11Apr 12, 2023Updated 3 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 9 months ago
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆131Feb 6, 2026Updated 2 months ago
- Morphological Parser for Russian is able to split words into morphemes: prefixes, roots, infixes and postfixes☆17Sep 13, 2020Updated 5 years ago
- This is a simple implementation of Saavedra-Barrera's paper SAAVEDRA-BARRERA R H. CPU Performance Evaluation and Execution Time Predictio…☆10Nov 23, 2021Updated 4 years ago
- [EMNLP 2025 Findings] A complete cross-modal RAG system for end-to-end speech-to-speech large models, including ASR-based Retrieval and E…☆31Jul 11, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆24Aug 14, 2025Updated 8 months ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- Official code for SongEcho☆57Mar 3, 2026Updated last month
- [ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.☆130Apr 7, 2026Updated last week
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 7 years ago
- ☆15Sep 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆34Mar 14, 2025Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- To overcome the limitation and obtain more appropriate control filters, a generative fixed-filter active noise control (GFANC) approach i…☆36Aug 28, 2025Updated 7 months ago
- The baselines of ARC-Challenge-Interspeech2026☆58Dec 1, 2025Updated 4 months ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago