finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆41Oct 14, 2022Updated 3 years ago
Alternatives and similar repositories for carmel
Users that are interested in carmel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15☆12Apr 17, 2017Updated 9 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 11 years ago
- Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL☆10Aug 11, 2016Updated 9 years ago
- ☆21Apr 4, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Dec 9, 2016Updated 9 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- DMV/CCM implementation☆17Jul 14, 2016Updated 9 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- ☆47May 22, 2017Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Nov 11, 2016Updated 9 years ago
- ☆14Feb 1, 2024Updated 2 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- hyp: hypergraphs toolkit☆31Jul 9, 2016Updated 9 years ago
- ☆10Aug 25, 2018Updated 7 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Dennis Klatt's speech synthesis system, updated with a Python interface.☆31Jun 23, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- HTK Toolkit with Linux 64 bit and Docker support☆20Oct 4, 2021Updated 4 years ago
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Feb 23, 2016Updated 10 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 8 months ago
- Generalized Language Modeling toolkit☆52Jun 21, 2022Updated 3 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Stack neural networks applied to hefty natural language tasks.☆15Dec 26, 2019Updated 6 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70May 8, 2026Updated 2 weeks ago
- Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet pow…☆73Mar 1, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Coqui STT (🐸STT) based forced alignment tool☆13Feb 24, 2022Updated 4 years ago
- ☆13Nov 16, 2022Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Aug 17, 2022Updated 3 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Mar 19, 2014Updated 12 years ago
- ☆28Jan 29, 2021Updated 5 years ago
- X (weighted / probabilistic) Context-Free Grammars☆25Jan 30, 2024Updated 2 years ago
- Semantic dependency relationship extractor untuk bahasa Indonesia... termasuk bahasa gaul dan alay ;) (terinspirasi oleh OpenCog RelEx)☆10Oct 2, 2015Updated 10 years ago