Deep Learning systems for training and testing disfluency detection and related tasks on speech data.
☆61Apr 22, 2026Updated last week
Alternatives and similar repositories for deep_disfluency
Users that are interested in deep_disfluency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆23Dec 8, 2022Updated 3 years ago
- Disfluency Detection using Auto-Correlational Neural Networks☆47Dec 23, 2020Updated 5 years ago
- ☆15Sep 2, 2017Updated 8 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆49May 2, 2021Updated 5 years ago
- Deep neural approach to Boundary and Disfluency Detection - Based on my Master's work☆19Jul 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- ☆39Jan 18, 2021Updated 5 years ago
- Disfluency Detection, Removal & Correction: Increase Apparent Public Speaking Fluency By Speech Augmentation (ICASSP '19)☆16Apr 14, 2020Updated 6 years ago
- Latex template for CUHK PhD Thesis☆13Jun 29, 2025Updated 10 months ago
- ☆12Apr 18, 2021Updated 5 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆83May 2, 2021Updated 5 years ago
- A pronunciation trainer w/ Python.☆15Sep 28, 2025Updated 7 months ago
- ☆11Jul 14, 2023Updated 2 years ago
- paper notes on nlp/cv/rl/dl☆14May 15, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- A casual and simple ChatGPT Python script that can run using terminal (as long as you have an API). Support Azure API.☆20May 3, 2025Updated last year
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆16Mar 19, 2021Updated 5 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Sep 18, 2017Updated 8 years ago
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 4 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Aug 3, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- A semantic and technical analysis of musical scores based on Information Retrieval Principles☆15Oct 13, 2022Updated 3 years ago
- ☆21Dec 9, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Daily creative coding sketches for Genuary 2024☆16Feb 1, 2024Updated 2 years ago
- Keras Implementation and Experiments with Deep Recurrent Neural Networks for Source Separation☆18May 4, 2018Updated 8 years ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Sep 22, 2024Updated last year
- ☆15Jun 17, 2019Updated 6 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- TAXREF-LD: the French Linked Data Taxonomic Register☆11Jul 30, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago