amritkromana/disfluency_detection_from_audio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amritkromana/disfluency_detection_from_audio)

amritkromana / disfluency_detection_from_audio

☆35

Alternatives and similar repositories for disfluency_detection_from_audio

Users that are interested in disfluency_detection_from_audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Fuann / hmamba
View on GitHub
Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…
☆16May 6, 2025Updated last year
pariajm / awesome-disfluency-detection
View on GitHub
A curated list of awesome disfluency detection publications along with the released code and bibliographical information
☆85May 2, 2021Updated 5 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pariajm / e2e-asr-and-disfluency-removal-evaluator
View on GitHub
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Mar 7, 2021Updated 5 years ago
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
hitwsl / transition_disfluency
View on GitHub
☆15Sep 2, 2017Updated 8 years ago
johnmartinsson / differentiable-mel-spectrogram
View on GitHub
The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …
☆24Dec 21, 2024Updated last year
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆22Jun 25, 2026Updated last month
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
skececi / gptfree
View on GitHub
Building or integrating an LLM wrapper shouldn't take more than 10 minutes.
☆13Feb 1, 2025Updated last year
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated 2 weeks ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
pariajm / joint-disfluency-detector-and-parser
View on GitHub
Improving Disfluency Detection by Self-Training a Self-Attentive Model
☆49May 2, 2021Updated 5 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
rhss10 / joint-apa-mdd-mtl
View on GitHub
Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…
☆25Nov 9, 2023Updated 2 years ago
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated 2 weeks ago
sildater / thegluenote
View on GitHub
TheGlueNote is representation model for note-wise music alignment.
☆14Jul 19, 2024Updated 2 years ago
pariajm / deep-disfluency-detector
View on GitHub
Disfluency Detection using Auto-Correlational Neural Networks
☆47Dec 23, 2020Updated 5 years ago
Kikyo-16 / airgen
View on GitHub
Official source codes of airsep
☆39Mar 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
carey-bunks / Jazz-Chord-Progressions-Corpus
View on GitHub
Jazz chord progression corpus and code for evaluating harmonic similarity
☆18Oct 20, 2023Updated 2 years ago
doheejin / HiPAMA
View on GitHub
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…
☆40Apr 29, 2024Updated 2 years ago
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
salgado / music-search
View on GitHub
Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'
☆16Nov 16, 2023Updated 2 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Hertin / WavPrompt
View on GitHub
☆37Jun 30, 2022Updated 4 years ago
XZWY / MSLDM
View on GitHub
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆29Sep 12, 2024Updated last year
LetianLee / Speech-Emotion-Recognition
View on GitHub
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …
☆34May 18, 2022Updated 4 years ago
miyakei1225 / React-Hands-on-Stopwatch
View on GitHub
技育CAMP用のリポジトリになります！
☆14May 21, 2025Updated last year
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 5 months ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
Louis0324 / DDSP-Articulatory-Vocoder
View on GitHub
☆29Sep 5, 2024Updated last year