kiasar / Dictionary_crawler
This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins to make a dataset
☆105Updated last year
Alternatives and similar repositories for Dictionary_crawler:
Users that are interested in Dictionary_crawler are comparing it to the libraries listed below
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- This is project convert The Online Plain Text English Dictionary (OPTED) to SQLite database and JSON files☆87Updated 4 years ago
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- PyDictionary is a Dictionary Module for Python 2/3 to get meanings, translations, synonyms and antonyms of words☆280Updated last year
- 30,000 most common English words with Chinese dictionary explanations in order of frequency.☆186Updated 5 years ago
- Offline database of synonyms/thesaurus☆195Updated last year
- Offline bilingual dictionaries made using data from Wiktionary☆54Updated 10 years ago
- A Python Wiktionary Parser☆359Updated 2 months ago
- This is an SQL file of Oxford English Dictionary. It includes more than 41,OOO words! Just import the SQL.☆48Updated 10 years ago
- A python module for English lemmatization and inflection.☆268Updated last year
- A modern, interlingual wordnet interface for Python☆244Updated this week
- api to retrieve word definitions and other info☆65Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆245Updated 2 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆34Updated 3 months ago
- Extract and align grammar patterns from English sentences.☆54Updated 2 years ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆123Updated 11 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆99Updated last week
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Lexical database of any language☆179Updated 2 years ago
- Stand-alone WordNet API☆48Updated 3 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- Break long English Sentence into simple sentences☆14Updated last year
- A list of the most popular English words.☆372Updated 2 years ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆93Updated 11 months ago
- Improved Sentence Alignment in Linear Time and Space☆171Updated 2 years ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆50Updated 2 weeks ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- SCOWL (and friends).☆419Updated 3 weeks ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆66Updated last year
- The Open English WordNet☆546Updated last week