placebokkk/ctc-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/placebokkk/ctc-asr)

placebokkk / ctc-asr

pytorch CTC implementation for ASR. Use eesen's fst decoder framework

☆10

Alternatives and similar repositories for ctc-asr

Users that are interested in ctc-asr are comparing it to the libraries listed below

Sorting:

placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 7 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Mar 26, 2022Updated 3 years ago
idiap / apam
View on GitHub
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…
☆14Feb 15, 2021Updated 5 years ago
DanielMengLiu / DeepLip
View on GitHub
deep-learning based audio-visual lip bometrics
☆15May 9, 2023Updated 2 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 4 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 3 years ago
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
YoungloLee / tf2-speech-recognition-transformer
View on GitHub
Tensorflow 2 Speech Recognition Code (Transformer)
☆25Jun 29, 2020Updated 5 years ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
View on GitHub
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Apr 27, 2022Updated 3 years ago
OrcusCZ / NNAcousticModeling
View on GitHub
☆24Sep 25, 2018Updated 7 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆65May 23, 2020Updated 5 years ago
idiap / kaldi-ivector
View on GitHub
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
☆88Feb 23, 2018Updated 8 years ago
lifeiteng / TTS-TextAnalyzer
View on GitHub
TTS Text Analyzer
☆31Jul 20, 2023Updated 2 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
usnistgov / F4DE
View on GitHub
Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…
☆25Jul 6, 2017Updated 8 years ago
hainan-xv / PASM
View on GitHub
Pronunciation-assisted Subword Modeling
☆31May 30, 2019Updated 6 years ago
PunkMale / OR-Gate
View on GitHub
Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.
☆11Oct 23, 2023Updated 2 years ago
soumimaiti / speechlmscore_tool
View on GitHub
☆32Nov 24, 2024Updated last year
guanlongzhao / ppg-gmm
View on GitHub
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Jan 15, 2020Updated 6 years ago
MatthewScholefield / noise-detector
View on GitHub
An ambient noise detector
☆10Aug 23, 2020Updated 5 years ago
nwpuaslp / kws_mia
View on GitHub
☆11Apr 20, 2020Updated 5 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆145Oct 8, 2024Updated last year
azraelkuan / repgan
View on GitHub
RepVgg + HiFiGAN
☆36Aug 10, 2022Updated 3 years ago
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆139Jun 7, 2021Updated 4 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 6 months ago
FreedomIntelligence / MTalk-Bench
View on GitHub
MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
☆17Nov 19, 2025Updated 3 months ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 2 years ago
dqqcasia / st
View on GitHub
End-to-end Speech Translation
☆35Apr 12, 2021Updated 4 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
formiel / speech-translation
View on GitHub
Multilingual speech translation
☆41Apr 15, 2021Updated 4 years ago
shane-settle / neural-acoustic-word-embeddings
View on GitHub
☆45Apr 5, 2019Updated 6 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆14Oct 15, 2022Updated 3 years ago
monatis / asr-annotation-bot
View on GitHub
Simple Telegram bot to annotate and varify automatic speech recognition datasets
☆12Mar 30, 2021Updated 4 years ago
frederick0329 / Text-Classification-Benchmark
View on GitHub
☆11Aug 28, 2017Updated 8 years ago
alainray / causal_inference
View on GitHub
Repository for my studies of Causal Inference
☆10Dec 1, 2019Updated 6 years ago
ahmdtaha / TextureClassification_FilterBank
View on GitHub
This repos provides an MATLAB code implementation for the Statistical Approach to Texture Classification from Single Images paper by Varm…
☆12Jan 30, 2018Updated 8 years ago
dongfangzhizhu / ECViT
View on GitHub
This project is a PyTorch implementation of the paper "ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-s…
☆19Jun 12, 2025Updated 8 months ago
ClarkWang1214 / AR-VirtualGlassesTryOn
View on GitHub
AR-VirtualGlassesTryOn
☆13May 21, 2016Updated 9 years ago