graehl / carmelLinks
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆41Updated 3 years ago
Alternatives and similar repositories for carmel
Users that are interested in carmel are comparing it to the libraries listed below
Sorting:
- Fast Word Clustering Software☆79Updated 11 months ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- A Combinatory Categorial Grammar library.☆22Updated 12 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Updated 5 years ago
- bin files☆13Updated 11 months ago
- ☆44Updated 10 years ago
- Expected edit distance implementation using OpenFst tools☆11Updated 10 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Updated last month
- Corpus preprocessing☆99Updated last year
- Discontinuous Data-Oriented Parsing☆46Updated 2 years ago
- Code for morphological transformations☆29Updated 8 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- ☆21Updated 10 years ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 7 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Updated 8 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆200Updated 5 years ago
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- bilingual dictionary extractor from parallel corpora☆23Updated 11 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- The Kyoyo Language Modeling Toolkit☆27Updated 11 years ago
- Fast and robust NLP components implemented in Java.☆53Updated 5 years ago
- Python framework for processing Universal Dependencies data☆59Updated this week
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆89Updated 8 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆42Updated 4 months ago
- A tool for text normalisation via character-level machine translation☆13Updated 5 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 5 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Updated 2 years ago