kiasar / Dictionary_crawlerLinks
This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins to make a dataset
☆106Updated 2 years ago
Alternatives and similar repositories for Dictionary_crawler
Users that are interested in Dictionary_crawler are comparing it to the libraries listed below
Sorting:
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- This is project convert The Online Plain Text English Dictionary (OPTED) to SQLite database and JSON files☆87Updated 4 years ago
- A Python Wiktionary Parser☆360Updated 3 months ago
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆124Updated last year
- Extract dates from text☆64Updated 4 years ago
- api to retrieve word definitions and other info☆66Updated 2 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 9 years ago
- Gather modern English word frequencies from all enwiki articles.☆213Updated last year
- Verb forms dictionary☆66Updated 7 years ago
- German part-of-speech dictionary☆45Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆91Updated last year
- PyDictionary is a Dictionary Module for Python 2/3 to get meanings, translations, synonyms and antonyms of words☆280Updated last year
- Linguistic search for large annotated text corpora, based on Apache Lucene☆112Updated this week
- Offline bilingual dictionaries made using data from Wiktionary☆55Updated 10 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆110Updated last week
- Lexical database for ~70k English words with morphological variables☆44Updated 3 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆35Updated 3 months ago
- Extract and align grammar patterns from English sentences.☆55Updated 2 years ago
- ☆26Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- A list of vocabulary lists☆21Updated 4 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆101Updated 2 weeks ago
- a CSV of every english word, part of speech, and definition. as well as a web scraping script that generates that data for you☆117Updated 2 years ago
- Tools for creating DSL-format dictionaries☆15Updated 3 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆12Updated last year
- A python module for English lemmatization and inflection.☆268Updated last year
- Extract data from Octopus mdict (*.mdd, *.mdx) files☆23Updated 7 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆126Updated 5 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated last month
- This packages up data for the Open Multilingual Wordnet☆49Updated this week