ybourque / Wikparser
Wiktionary Parser
☆28Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for Wikparser
- A parser and autocorrection tool for wiktionary.☆39Updated 8 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Updated 5 years ago
- Hierarchical phrase-based machine translation system☆32Updated 9 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated this week
- The Community-enRiched Open WordNet (CROWN)☆19Updated 8 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆185Updated 4 years ago
- Morpha lex stemmer converted using jflex.☆23Updated 4 years ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- Dependency parse tree visualization with D3 library.☆42Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 6 months ago
- A fast, simple, multilingual tokenizer☆28Updated 7 years ago
- PurePos is an open source hybrid morphological tagger.☆15Updated 4 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Updated 7 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Code for morphological transformations☆29Updated 7 years ago
- Fast Word Clustering Software☆74Updated 3 months ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 2 months ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- Command-line corpus tools☆9Updated 7 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆124Updated this week
- Treex NLP framework☆33Updated this week
- Transliteration package for Indian scripts☆16Updated 7 years ago
- An Enhanced Lesk Word Sense Disambiguation Algorithm through a Distributional Semantic Model☆23Updated 7 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆40Updated 2 weeks ago
- Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.☆16Updated 11 years ago