benreynwar / wiktionary-parser
A parser and autocorrection tool for wiktionary.
☆39Updated 8 years ago
Related projects: ⓘ
- Hierarchical phrase-based machine translation system☆31Updated 9 years ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Machine translation for the real world☆23Updated 4 years ago
- Java Wiktionary Library☆57Updated last year
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated last week
- Thot toolkit for statistical machine translation☆50Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆180Updated 3 years ago
- The Language Learning Toolkit (LLTK) performs a variety of tasks useful for (human) language learning.☆41Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 4 months ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆11Updated last year
- Command-line corpus tools☆9Updated 7 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆104Updated 8 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Updated 5 years ago
- Fast Word Clustering Software☆74Updated last month
- http://www.ark.cs.cmu.edu/ARKref/☆32Updated 10 years ago
- Machine-readable Wiktionary☆74Updated 4 months ago
- Translation of query languages to serialized KoralQuery protocol☆10Updated last week
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆44Updated 3 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- NLTK Contrib☆166Updated 6 months ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 2 years ago
- Joshua Statistical Machine Translation Toolkit☆121Updated 8 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 8 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- ☆12Updated this week