benreynwar / wiktionary-parserLinks
A parser and autocorrection tool for wiktionary.
☆39Updated 9 years ago
Alternatives and similar repositories for wiktionary-parser
Users that are interested in wiktionary-parser are comparing it to the libraries listed below
Sorting:
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Simple CORPORA list crawler☆10Updated 8 years ago
- Fast Word Clustering Software☆78Updated 3 months ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 5 months ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Bilingual sentence aligner (Gale & Church, 1993)☆14Updated 6 years ago
- UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.☆34Updated 2 years ago
- GermaNet API for Python☆53Updated 7 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- stav text annotation visualiser☆34Updated 13 years ago
- Annodoc annotation documentation support system☆34Updated 4 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Machine-readable Wiktionary☆76Updated last year
- Excitement Open Platform for Recognizing Textual Entailments☆89Updated 7 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆34Updated 3 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 3 years ago
- Command-line corpus tools☆9Updated 8 years ago
- A Recurrent Neural Network trained on all existing TED Talk Transcripts. The model outputs machine generated TED Talks.☆51Updated 7 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆193Updated 4 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago