flashlight/text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/flashlight/text)

flashlight / text

Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.

☆78

Alternatives and similar repositories for text

Users that are interested in text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
nvidia-riva / riva-asrlib-decoder
View on GitHub
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Feb 18, 2025Updated last year
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jumon / whisper-punctuator
View on GitHub
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆120Feb 4, 2023Updated 3 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
NVIDIA / NeMo-text-processing
View on GitHub
NeMo text processing for ASR and TTS
☆484Updated this week
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
jfulponi / ScalingShinyUsingDocker
View on GitHub
Create a Docker image for your Shiny App and upload it to Google Cloud without dying trying.
☆13Aug 29, 2022Updated 3 years ago
besacier / ASR2022
View on GitHub
☆57Dec 19, 2022Updated 3 years ago
revdotcom / fstalign
View on GitHub
An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
☆169May 12, 2026Updated 2 months ago
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
iamjanvijay / rnnt_decoder_cuda
View on GitHub
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
☆67Jan 7, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
revdotcom / speech-datasets
View on GitHub
Various speech datasets made available to the public
☆135May 29, 2026Updated last month
davispolito / Phase-Vocoder
View on GitHub
☆13Apr 10, 2020Updated 6 years ago
winlinvip / srs-k2
View on GitHub
Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC
☆20Apr 16, 2023Updated 3 years ago
farisalasmary / wav2vec2-kenlm
View on GitHub
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆74Oct 11, 2021Updated 4 years ago
menon92 / Bangla-Word2Vec
View on GitHub
Bangla word2vec using skipgram approach
☆16Jun 28, 2023Updated 3 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
hoangtuanvu / conformer_ocr
View on GitHub
Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…
☆10Dec 27, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yanggeng1995 / WaveRNN
View on GitHub
☆34Jul 16, 2019Updated 7 years ago
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
mlverse / safetensors
View on GitHub
Safetensors File Format
☆11Apr 27, 2026Updated 2 months ago
menon92 / BanglaText2Image
View on GitHub
Synthetic data generation for bangla OCR
☆18Dec 1, 2022Updated 3 years ago
yandex / agglomerative_clustering
View on GitHub
☆15Jun 10, 2020Updated 6 years ago
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆215Jul 10, 2026Updated last week
daanzu / wav2vec2_stt_python
View on GitHub
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆23Aug 16, 2021Updated 4 years ago
HLTCHKUST / ASCEND
View on GitHub
ASCEND Chinese-English code-switching dataset
☆33Jul 12, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated 3 weeks ago
thu-spmi / CAT
View on GitHub
CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…
☆368Feb 5, 2026Updated 5 months ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated last week
rhasspy / gruut
View on GitHub
A tokenizer, text cleaner, and phonemizer for many human languages.
☆330Nov 15, 2024Updated last year