graehl / carmel
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆41Updated 2 years ago
Alternatives and similar repositories for carmel:
Users that are interested in carmel are comparing it to the libraries listed below
- bin files☆13Updated 2 weeks ago
- A Combinatory Categorial Grammar library.☆22Updated 11 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- ☆21Updated 9 years ago
- Fast Word Clustering Software☆78Updated last week
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 7 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Code for morphological transformations☆29Updated 7 years ago
- ☆43Updated 9 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- Expected edit distance implementation using OpenFst tools☆11Updated 9 years ago
- Parsito: Fast non-projective transition-based dependency parser☆14Updated 2 years ago
- Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner☆27Updated 5 years ago
- A Python interface to OpenFst☆89Updated 5 years ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 6 years ago
- Read-only unofficial mirror of Pynini☆17Updated 5 years ago
- Simple, standalone python classes for training statistical language models using several popular smoothing methods.☆25Updated 12 years ago
- Command-line corpus tools☆9Updated 7 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 5 months ago
- Deep learning model of machine translation using attentional and structural biases☆13Updated 7 years ago
- OxLM: Oxford Neural Language Modelling Toolkit☆38Updated 9 years ago
- Automatically exported from code.google.com/p/deepsyntacticparsing☆23Updated 9 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- Corpus preprocessing☆95Updated 11 months ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year