igormq/ctcdecode-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/igormq/ctcdecode-pytorch)

igormq / ctcdecode-pytorch

Python implementation of CTC beam search decoder + agnostic LM scorer

☆20

Alternatives and similar repositories for ctcdecode-pytorch

Users that are interested in ctcdecode-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huangyz0918 / kws-continual-learning
View on GitHub
[ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting
☆17Jun 7, 2022Updated 4 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
eastonYi / end-to-end_asr_pytorch
View on GitHub
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Jul 28, 2020Updated 5 years ago
matthewfl / openfst-wrapper
View on GitHub
☆28Jan 29, 2021Updated 5 years ago
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
nglehuy / ctc_decoders
View on GitHub
Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model
☆24Oct 28, 2023Updated 2 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
uiuc-sst / asr24
View on GitHub
24-hour Automatic Speech Recognition
☆27Jun 4, 2021Updated 5 years ago
VKW2021 / kaldi-baseline
View on GitHub
kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
opendcd / opendcd
View on GitHub
Open Source WFST-based Decoder Toolkit
☆75Feb 11, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stas6626 / IDRnd
View on GitHub
ID R&D Voice Antispoofing Challenge Solution
☆11Jul 27, 2019Updated 6 years ago
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated last year
diff7 / tts-king
View on GitHub
a repository for trainabale tts multi speaker
☆14Nov 28, 2021Updated 4 years ago
danpovey / kaldi_lm
View on GitHub
Old language modeling tool that's used in kaldi
☆17Apr 20, 2023Updated 3 years ago
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
TeaPoly / CE-OptimizedLoss
View on GitHub
Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…
☆25Oct 11, 2024Updated last year
42io / c_keyword_spotting
View on GitHub
Keyword Spotting suitable for embedded devices.
☆28Jun 22, 2020Updated 6 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Pelhans / ZASR_tensorflow
View on GitHub
Mandarin ASR system based on tensorflow
☆108Aug 20, 2018Updated 7 years ago
nwpuaslp / kws_mia
View on GitHub
☆11Apr 20, 2020Updated 6 years ago
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
arysin / nlp_uk_api
View on GitHub
☆11Oct 19, 2024Updated last year
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
aispeech-lab / w2v-cif-bert
View on GitHub
☆37Jun 28, 2021Updated 5 years ago
speechpro / cloud-python
View on GitHub
Python клиент API распознавания и синтеза речи Облака ЦРТ
☆11Dec 26, 2022Updated 3 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
turnerdan / joethecorpusrogan
View on GitHub
A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…
☆21Jan 26, 2020Updated 6 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago