pariajm/e2e-asr-and-disfluency-removal-evaluator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pariajm/e2e-asr-and-disfluency-removal-evaluator)

pariajm / e2e-asr-and-disfluency-removal-evaluator

A new metric for evaluating end-to-end speech recognition and disfluency removal systems

☆19

Alternatives and similar repositories for e2e-asr-and-disfluency-removal-evaluator

Users that are interested in e2e-asr-and-disfluency-removal-evaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pariajm / joint-disfluency-detector-and-parser
View on GitHub
Improving Disfluency Detection by Self-Training a Self-Attentive Model
☆49May 2, 2021Updated 5 years ago
pariajm / sharif-emotional-speech-dataset
View on GitHub
A large-scale validated database for Persian speech emotion detection.
☆25May 9, 2022Updated 4 years ago
pariajm / awesome-disfluency-detection
View on GitHub
A curated list of awesome disfluency detection publications along with the released code and bibliographical information
☆85May 2, 2021Updated 5 years ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
Liangzheng-ZL / BEdit-TTS
View on GitHub
Speech samples and code of BEdit-TTS
☆34Oct 8, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
vickyzayats / switchboard_corrected_reannotated
View on GitHub
Automatic Mapping of Disfluency Annotations for corrected version of Switchboard
☆17Sep 27, 2019Updated 6 years ago
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
omidmnezami / pick-object-attack
View on GitHub
Type-Specific Adversarial Attack for Object Detection
☆13Aug 27, 2021Updated 4 years ago
SALT-NLP / Disfluency-Generation-and-Detection
View on GitHub
Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"
☆16Apr 25, 2022Updated 4 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
rossellhayes / ipa
View on GitHub
🗣️ Convert between phonetic alphabets
☆11Feb 7, 2022Updated 4 years ago
omidmnezami / Face-Cap
View on GitHub
Face-Cap: Image Captioning using Facial Expression Analysis
☆17Apr 16, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
changelinglab / prism
View on GitHub
A toolkit and benchmark for evaluating phonetic capabilities of speech models.
☆18Apr 10, 2026Updated 3 months ago
marc-moreaux / audioset_raw
View on GitHub
Download and create a tfreader for the audioset dataset
☆17Apr 16, 2020Updated 6 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
audioku / meta-transfer-learning
View on GitHub
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
☆52Jul 30, 2020Updated 5 years ago
pariajm / deep-disfluency-detector
View on GitHub
Disfluency Detection using Auto-Correlational Neural Networks
☆47Dec 23, 2020Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
HLTCHKUST / ASCEND
View on GitHub
ASCEND Chinese-English code-switching dataset
☆33Jul 12, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
audio-captioning / caption-evaluation-tools
View on GitHub
Tools for the evaluation of audio captioning.
☆19May 23, 2020Updated 6 years ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
sooftware / End-to-End-Speech-Recognition-Models
View on GitHub
PyTorch implementation of automatic speech recognition models.
☆38Jan 10, 2021Updated 5 years ago
emonosuke / emoASR
View on GitHub
End-to-end MOdeling of ASR (Automatic Speech Recognition)
☆33Feb 16, 2023Updated 3 years ago
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
emirdemirel / DALI-TestSet4ALT
View on GitHub
This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.
☆12Nov 30, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
naka-lab / HDP-GP-HSMM
View on GitHub
☆11Apr 23, 2024Updated 2 years ago
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
YounesAbounaceur / Conferences_Recommender_System
View on GitHub
For all the researchers who after waiting for a long time, they get their research papers rejected by big international conferences. This…
☆10Dec 6, 2020Updated 5 years ago
zengzp0912 / SEAME-dev-set
View on GitHub
SEAME corpus two develop set
☆42Dec 5, 2019Updated 6 years ago
TartuNLP / tts_preprocess_et
View on GitHub
Estonian text-to-speech text normalization pipeline
☆14Dec 17, 2025Updated 7 months ago