zseder / hundictLinks
bilingual dictionary extractor from parallel corpora
☆22Updated 11 years ago
Alternatives and similar repositories for hundict
Users that are interested in hundict are comparing it to the libraries listed below
Sorting:
- UniParse: A universal graph-based parsing toolkit☆10Updated 6 years ago
- ☆23Updated 8 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- ☆44Updated 10 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆36Updated 6 years ago
- Corpus preprocessing☆99Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- ☆21Updated 8 years ago
- Democratizing NLP!☆105Updated last year
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Updated 10 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆88Updated 8 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆79Updated 2 years ago
- Transition-based UCCA Parser☆73Updated 4 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"☆21Updated 8 years ago
- ☆47Updated 8 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆69Updated 6 years ago
- Code for morphological transformations☆29Updated 8 years ago
- A temporal ordering system for events and time expressions in written text.☆42Updated 3 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Efficient Markov Chain word alignment☆52Updated 4 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 9 months ago
- MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation o…☆22Updated 8 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 11 years ago