sdrobert/pydrobert-kaldi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sdrobert/pydrobert-kaldi)

sdrobert / pydrobert-kaldi

SWIG bindings for Kaldi I/O, built with Conda

☆15

Alternatives and similar repositories for pydrobert-kaldi

Users that are interested in pydrobert-kaldi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dansoutner / kaldi2htk
View on GitHub
Script for converting kaldi GMM/HMM models to HTK format
☆11Jul 18, 2024Updated 2 years ago
psmit / kaldi-nnettf
View on GitHub
Kaldi code for doing DNN with tensorflow
☆13Feb 8, 2016Updated 10 years ago
mravanelli / theano-kaldi-rnn
View on GitHub
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…
☆34Apr 15, 2018Updated 8 years ago
sdrobert / pydrobert-pytorch
View on GitHub
PyTorch utilities for ML, specifically speech
☆13Jan 30, 2024Updated 2 years ago
emsansone / GAN
View on GitHub
Tutorial on GANs
☆13Jul 9, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
kdavis-mozilla / iris
View on GitHub
Demo WebApp using Kaldi DNN engine to convert speech to text
☆11Jun 12, 2016Updated 10 years ago
GMMTeam / GMM
View on GitHub
SSE/GPU-accelerated training and evaluation of Gaussian Mixture Models (GMMs)
☆17Oct 2, 2014Updated 11 years ago
mcfletch / sphfile
View on GitHub
NIST SPH File reader (e.g. for TEDLIUM Corpus)
☆26May 2, 2020Updated 6 years ago
danieldimatteo / android-speech-diarization
View on GitHub
An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…
☆14Apr 12, 2021Updated 5 years ago
jfainberg / lattice_combination
View on GitHub
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Mar 19, 2024Updated 2 years ago
bastibe / PySoundFile
View on GitHub
DEPRECATED version of SoundFile
☆14May 26, 2020Updated 6 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago
bootphon / abkhazia
View on GitHub
ABX and kaldi experiments on speech corpora made easy
☆33Oct 7, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
opendcd / opendcd
View on GitHub
Open Source WFST-based Decoder Toolkit
☆75Feb 11, 2016Updated 10 years ago
torogmw / MusicSegmentation
View on GitHub
a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…
☆16Apr 19, 2013Updated 13 years ago
oplatek / e2end
View on GitHub
DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)
☆11Sep 28, 2016Updated 9 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
AI-Guru / SincNet
View on GitHub
Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)
☆12Aug 5, 2018Updated 7 years ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆37Aug 15, 2019Updated 6 years ago
ndkgit339 / spe-dss
View on GitHub
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43May 9, 2023Updated 3 years ago
idiap / phonvoc
View on GitHub
Phonetic and phonological vocoding platform
☆17Nov 23, 2016Updated 9 years ago
larsvers / Understanding-Zoom
View on GitHub
☆16Jan 18, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gfdb / wav2aug
View on GitHub
A general purpose task-agnostic speech augmentation policy
☆16Mar 13, 2026Updated 4 months ago
keenresearch / keenasr-ios-poc
View on GitHub
Proof of concept app that demonstrates use of KeenASR SDK in ObjC. WE ARE HIRING: https://keenresearch.com/careers.html
☆70Jun 30, 2026Updated 2 weeks ago
ZhangAustin / Deep-Speech
View on GitHub
Deep Learning for Speech Recogntion based on Theano
☆15Jul 28, 2017Updated 8 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
cunnie / DiarTk
View on GitHub
A fork of Idiap Research Institute's DiarTk diarization toolkit
☆16Feb 20, 2016Updated 10 years ago
gooofy / py-kaldi-asr
View on GitHub
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
☆169Feb 23, 2021Updated 5 years ago
yashiro32 / speech_recognition
View on GitHub
Speech recognition
☆13Dec 27, 2014Updated 11 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
hipstas / kaldi-pop-up-archive
View on GitHub
A Docker image for the Kaldi speech recognition tool + training data from Pop Up Archive
☆20Mar 12, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
srvk / srvk-eesen-offline-transcriber
View on GitHub
Top level code to transcribe English audio/video files into text/subtitles
☆21Jun 12, 2018Updated 8 years ago
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
CaydenPierce / MSA
View on GitHub
Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition
☆18May 12, 2022Updated 4 years ago
naxingyu / kaldi-nn
View on GitHub
Extended speech recognition neural network based on Kaldi for reproducible research
☆15Aug 28, 2015Updated 10 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago