benreynwar / wiktionary-parser
A parser and autocorrection tool for wiktionary.
☆39Updated 9 years ago
Alternatives and similar repositories for wiktionary-parser
Users that are interested in wiktionary-parser are comparing it to the libraries listed below
Sorting:
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 4 months ago
- Java Wiktionary Library☆57Updated 2 years ago
- Command-line corpus tools☆9Updated 8 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- Supervised learning of morphology☆28Updated 8 years ago
- NLTK Contrib☆166Updated last year
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago
- Scrapes some Finnish word definitions from English Wiktionary.☆8Updated last year
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Machine translation for the real world☆23Updated 5 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- The WikiBrain Java library enables researchers and developers to incorporate state-of-the-art Wikipedia-based algorithms and technologies…☆93Updated 6 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 6 years ago
- Wiktionary Parser☆28Updated 8 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated last year
- Word and text similarity measures☆54Updated 2 years ago
- English Dependency Relationship Extractor☆85Updated 4 months ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- A toolkit for corpus linguistics☆205Updated 5 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆89Updated 7 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated last month