zaibacu / thesaurus
Offline database of synonyms/thesaurus
☆191Updated last year
Alternatives and similar repositories for thesaurus:
Users that are interested in thesaurus are comparing it to the libraries listed below
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated this week
- A modern, interlingual wordnet interface for Python☆233Updated last week
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆238Updated 2 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 4 months ago
- Verb forms dictionary☆63Updated 7 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆33Updated last year
- A Python Wiktionary Parser☆358Updated last year
- WordNet to neo4j 2.2☆12Updated 9 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆335Updated 3 years ago
- The Open English WordNet☆505Updated 3 weeks ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- The Open Multilingual Wordnet☆61Updated 9 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆64Updated this week
- A Python library to parse MediaWiki WikiText☆299Updated 3 months ago
- Gather modern English word frequencies from all enwiki articles.☆209Updated 11 months ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated 4 months ago
- A python module for English lemmatization and inflection.☆265Updated last year
- TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted…☆246Updated 9 years ago
- Targetted language identifier, based on FastText and Hunspell.☆33Updated this week
- Crawler for linguistic corpora☆199Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 2 years ago
- A character-wise tokenizer for morphologically rich languages☆27Updated last month
- Sentence aligner☆108Updated 3 years ago
- roll a wikipedia dump into mongo☆241Updated 7 months ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Stand-alone WordNet API☆48Updated 2 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins t…☆103Updated last year