danpovey/k2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/danpovey/k2)

danpovey / k2

FSA/FST algorithms, intended to (eventually) be interoperable with PyTorch and similar

☆26

Alternatives and similar repositories for k2

Users that are interested in k2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

speechcatcher-asr / speechcatcher-data
View on GitHub
☆11Sep 5, 2025Updated 10 months ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated last week
pkufool / cppinyin
View on GitHub
Converting Chinese sentences into pinyin sequences, implemented in C++, very fast and easy to deploy.
☆23Jan 5, 2026Updated 6 months ago
open-speech / tf_kaldi_io
View on GitHub
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
☆40Nov 26, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
speechcatcher-asr / speechcatcher
View on GitHub
☆48Nov 2, 2025Updated 8 months ago
KarelVesely84 / kaldi-io-for-python
View on GitHub
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆378Jun 16, 2023Updated 3 years ago
grazder / samejs
View on GitHub
Streaming Audio Models Examples in JS
☆20Mar 29, 2024Updated 2 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
sogouspeech / xvads
View on GitHub
eXtended Voice Activity Detection Splitter
☆16Nov 22, 2019Updated 6 years ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,143Jun 22, 2026Updated last month
BennoKrojer / reasoning-over-facts
View on GitHub
This repository contains code for the paper "Are Pretrained Language Models Symbolic Reasoners over Knowledge?"
☆13Mar 23, 2021Updated 5 years ago
danpovey / openfst
View on GitHub
Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.
☆13Mar 8, 2016Updated 10 years ago
jarfo / gcommands
View on GitHub
Speech Commands Recognition using end-to-end deep learning models in pytorch
☆28Oct 8, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mguner / audio_search
View on GitHub
Use speech_to_text for keyword search in audio files.
☆12May 5, 2021Updated 5 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
rwightman / pytorch-commands
View on GitHub
Some PyTorch code for the Kaggle Speech Recognition Challenge
☆13Feb 7, 2019Updated 7 years ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
tdillon / android
View on GitHub
Publish Android to Google Play with Travis-CI
☆18Oct 21, 2016Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JRMeyer / common-voice-stats
View on GitHub
A living document for all things Common Voice.
☆14Jun 24, 2024Updated 2 years ago
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
zhukaii / OS-TR
View on GitHub
☆10Sep 7, 2019Updated 6 years ago
espnet / warp-ctc
View on GitHub
Pytorch Bindings for warp-ctc maintained by ESPnet
☆17Feb 20, 2021Updated 5 years ago
lezasantaizi / Spoken-language-identification
View on GitHub
方言分类，pytorch
☆44Sep 25, 2018Updated 7 years ago
26hzhang / bert_classification
View on GitHub
Token and Sentence Level Classification with Google's BERT (TensorFlow)
☆10Jul 11, 2019Updated 7 years ago
zoisboukouvalas / pyiva
View on GitHub
Implementation of the independent vector analysis (IVA) algorithm using a multivariate Laplace prior
☆28Mar 27, 2021Updated 5 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
houxingxing / Multi-Criteria-Active-Deep-Learning-for-Image-Classification
View on GitHub
Multi-Criteria Active Deep Learning for Image Classification
☆10Apr 14, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
k2-fsa / snowfall
View on GitHub
Moved to https://github.com/k2-fsa/icefall
☆146Oct 13, 2022Updated 3 years ago
jackyyy0228 / Chinese-ASR
View on GitHub
Chinese-ASR built on kaldi
☆14Jan 21, 2019Updated 7 years ago
RicherMans / Dcase2018_pooling
View on GitHub
Repo for our pooling approach on the DCASE2018 task4
☆16Jul 6, 2023Updated 3 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
skaae / DeepLearnToolbox
View on GitHub
Matlab/Octave toolbox for deep learning. Includes Deep Belief Nets, Stacked Autoencoders, Convolutional Neural Nets, Convolutional Autoen…
☆21Jun 23, 2014Updated 12 years ago
lucasondel / multilingual-bottleneck-features
View on GitHub
BUT Multilingual Bottleneck Features
☆15Mar 22, 2019Updated 7 years ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
View on GitHub
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Jun 16, 2022Updated 4 years ago