finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆41Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for carmel
Users that are interested in carmel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 11 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- ☆21Apr 4, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Bilingual sentence aligner (Gale & Church, 1993)☆14Jan 8, 2026Updated 5 months ago
- ☆21Dec 9, 2016Updated 9 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 12 years ago
- DMV/CCM implementation☆17Jul 14, 2016Updated 9 years ago
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- A Python package for processing research with Minimalist grammars☆22Nov 13, 2021Updated 4 years ago
- ☆47May 22, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆30Dec 15, 2014Updated 11 years ago
- ☆11Oct 13, 2019Updated 6 years ago
- hyp: hypergraphs toolkit☆31Jul 9, 2016Updated 9 years ago
- ☆10Aug 25, 2018Updated 7 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆15Sep 5, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Some tools for JSGF grammar expansion. Generate sentences from a JSGF Grammar. I originally wrote this over the course of a week, so I se…☆17Oct 6, 2025Updated 8 months ago
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 9 months ago
- Generalized Language Modeling toolkit☆53Jun 21, 2022Updated 4 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Stack neural networks applied to hefty natural language tasks.☆15Dec 26, 2019Updated 6 years ago
- ☆11Mar 20, 2021Updated 5 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆71Jun 15, 2026Updated 2 weeks ago
- Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet pow…☆73Mar 1, 2024Updated 2 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Extract, parse and populate templates from strings☆28Apr 4, 2019Updated 7 years ago
- ☆13Updated this week
- Barista is an open-source framework for concurrent speech processing.☆36Mar 19, 2014Updated 12 years ago
- Wiktionary parser tool for many language editions.☆54Aug 17, 2022Updated 3 years ago
- ☆28Jan 29, 2021Updated 5 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- X (weighted / probabilistic) Context-Free Grammars☆25Jan 30, 2024Updated 2 years ago