SWIG bindings for Kaldi I/O, built with Conda
☆15Dec 15, 2024Updated last year
Alternatives and similar repositories for pydrobert-kaldi
Users that are interested in pydrobert-kaldi are comparing it to the libraries listed below
Sorting:
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Kaldi code for doing DNN with tensorflow☆13Feb 8, 2016Updated 10 years ago
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆34Apr 15, 2018Updated 7 years ago
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Feb 23, 2016Updated 10 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆26May 2, 2020Updated 5 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 8 years ago
- DEPRECATED version of SoundFile☆14May 26, 2020Updated 5 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- Open Source WFST-based Decoder Toolkit☆77Feb 11, 2016Updated 10 years ago
- DEPRECATED: research attempt to build e2e task oriented chatbot optimized over conversational data and content of DB (single table)☆11Sep 28, 2016Updated 9 years ago
- a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…☆16Apr 19, 2013Updated 12 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆36Aug 15, 2019Updated 6 years ago
- Proof of concept app that demonstrates use of KeenASR SDK in ObjC. WE ARE HIRING: https://keenresearch.com/careers.html☆70Feb 6, 2026Updated last month
- Deep Learning for Speech Recogntion based on Theano☆15Jul 28, 2017Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Feb 23, 2021Updated 5 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- A Docker image for the Kaldi speech recognition tool + training data from Pop Up Archive☆20Mar 12, 2019Updated 7 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- ☆16Jan 18, 2018Updated 8 years ago
- Top level code to transcribe English audio/video files into text/subtitles☆21Jun 12, 2018Updated 7 years ago
- ☆15Jan 24, 2017Updated 9 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- PythonのGUIライブラリであるKivyの公式ドキュメント (https://kivy.org/docs/) の日本語訳☆12Feb 14, 2019Updated 7 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Mar 11, 2021Updated 5 years ago
- CAMEL (Content-based Audio and Music Extraction Library) is an easy-to-use C++ framework developed for content-based audio and music anal…☆21Jun 21, 2013Updated 12 years ago