benreynwar / wiktionary-parser
A parser and autocorrection tool for wiktionary.
☆39Updated 9 years ago
Alternatives and similar repositories for wiktionary-parser:
Users that are interested in wiktionary-parser are comparing it to the libraries listed below
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Hierarchical phrase-based machine translation system☆33Updated 10 years ago
- Machine-readable Wiktionary☆74Updated 8 months ago
- NLTK Contrib☆166Updated 10 months ago
- Java Wiktionary Library☆57Updated 2 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 7 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆66Updated last month
- Excitement Open Platform for Recognizing Textual Entailments☆86Updated 7 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- The Zurich Dependency Parser for German☆82Updated 2 years ago
- Command-line corpus tools☆9Updated 7 years ago
- Wiktionary Parser☆28Updated 7 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 3 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated last month
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Joshua Statistical Machine Translation Toolkit☆122Updated 8 years ago
- Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) contex…☆183Updated 4 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆50Updated 5 years ago
- Code for morphological transformations☆29Updated 7 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 8 months ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year