Acoustic and language models for minorised languages.
☆26Sep 30, 2020Updated 5 years ago
Alternatives and similar repositories for cmusphinx-models
Users that are interested in cmusphinx-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆10Dec 16, 2022Updated 3 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆10May 22, 2023Updated 3 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- ☆13Nov 16, 2022Updated 3 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Tools for working with the CMU Pronunciation Dictionary☆36Sep 5, 2017Updated 8 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The…☆10Apr 17, 2020Updated 6 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆14Oct 8, 2020Updated 5 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 5 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 6 years ago
- ☆12Aug 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 5 years ago
- ☆23Apr 8, 2022Updated 4 years ago
- Deep Learning experiments☆19Nov 18, 2019Updated 6 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- ☆14Nov 13, 2022Updated 3 years ago
- ☆16Feb 9, 2024Updated 2 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- A Python 2.7 implementation of Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) algorithms for Automated Speech …☆17Apr 23, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Mar 7, 2019Updated 7 years ago
- Mispronunciation detection code for jingju singing voice☆19Sep 5, 2018Updated 7 years ago
- Counts frequencies of words using movie and television subtitles.☆20Jan 26, 2015Updated 11 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago