LGDoor / Dump-of-Simple-English-WikiLinks
☆17Updated 12 years ago
Alternatives and similar repositories for Dump-of-Simple-English-Wiki
Users that are interested in Dump-of-Simple-English-Wiki are comparing it to the libraries listed below
Sorting:
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated 3 weeks ago
- The Open English WordNet☆677Updated last week
- A JavaScript implementation of Douglas Hofstadter and Melanie Mitchell's Copycat program.☆31Updated 4 months ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.☆49Updated 2 years ago
- English Resource Grammar☆24Updated last month
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- The largest English-language thesaurus☆309Updated 2 months ago
- universal tokenizer☆16Updated 4 years ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆139Updated last year
- Word embeddings for the web☆28Updated 2 years ago
- Plot suggestions for writers of creative fiction☆141Updated 2 years ago
- Jupyter notebooks for course "Computational Morphology with HFST".☆19Updated 3 years ago
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- Neural network poetry rewriter☆21Updated 3 years ago
- A Python Wiktionary Parser☆367Updated 4 months ago
- A seriously hacky editor for text.☆36Updated 5 years ago
- Machine-readable Wiktionary☆77Updated last year
- Transliteration models for 21 Indic languages☆105Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 5 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆353Updated 3 years ago
- Toki Pona corpus for NLTK☆15Updated 6 years ago
- A modern, interlingual wordnet interface for Python☆276Updated last week
- browse wikipedia a la andy matuschak's evergreen notes☆29Updated last year
- WordNet in JSON format.☆96Updated 5 years ago
- Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)☆105Updated last week
- Aksharamukha Python Library☆55Updated 10 months ago
- A tiny, self-contained JavaScript wiki that runs in the browser [MIRROR]☆139Updated last month
- A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!☆51Updated 2 years ago
- The World Atlas of Language Structures☆72Updated last year
- downloads and parses subtitle dataset from opensubtitles.org☆16Updated last year