componavt / wikokit
Machine-readable Wiktionary
☆75Updated 9 months ago
Alternatives and similar repositories for wikokit:
Users that are interested in wikokit are comparing it to the libraries listed below
- Java Wiktionary Library☆57Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 3 months ago
- German part-of-speech dictionary☆43Updated last year
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆16Updated 5 years ago
- A Python Wiktionary Parser☆357Updated last year
- Helsinki Finite-State Technology (library and application suite)☆128Updated 3 weeks ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- Sentence aligner☆109Updated 3 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆74Updated 2 weeks ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆89Updated last year
- CRF-based Morphological Tagging and Lemmatization☆36Updated 5 years ago
- A collection of tools for reading/processing the multilingual Bible corpus☆15Updated 2 years ago
- Imports Wiktionary's grammatical data into Wikidata☆17Updated 5 years ago
- A multilingual parallel corpus created from translations of the Bible.☆177Updated 5 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Open morphology for Finnish☆87Updated last month
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆189Updated 4 years ago
- Gather modern English word frequencies from all enwiki articles.☆211Updated 11 months ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆51Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆73Updated 2 months ago
- WordNet-LMF formats☆21Updated last week
- A list of vocabulary lists☆21Updated 4 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆46Updated 3 months ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year