kiasar / Dictionary_crawler
This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins to make a dataset
☆102Updated last year
Alternatives and similar repositories for Dictionary_crawler:
Users that are interested in Dictionary_crawler are comparing it to the libraries listed below
- This is project convert The Online Plain Text English Dictionary (OPTED) to SQLite database and JSON files☆86Updated 4 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- download youtube subtitles(closed caption, cc) as txt or json, support translation and proxy. available on PIP 🐍 . try it online at goo…☆70Updated last year
- PyDictionary is an offline English dictionary made using Python along with the Wordnet Lexical Database and Enchant Spell Dictionary. The…☆17Updated 3 years ago
- Scripts for building a geo-located web corpus using Common Crawl data☆11Updated 2 weeks ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A Python Wiktionary Parser☆357Updated last month
- Offline bilingual dictionaries made using data from Wiktionary☆53Updated 9 years ago
- This is an SQL file of Oxford English Dictionary. It includes more than 41,OOO words! Just import the SQL.☆47Updated 10 years ago
- This repository provides various Python methods for finding and aggregating synonyms for an individual word or a list of words.☆34Updated 2 years ago
- PyMultiDictionary is a dictionary module that gets meanings, translations, synonyms, and antonyms of words in 20 different languages☆50Updated this week
- This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.☆123Updated 10 months ago
- A python module for English lemmatization and inflection.☆266Updated last year
- Python library for downloading closed captions(subtitles) from Youtube☆61Updated last year
- LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence …☆11Updated 9 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆243Updated 2 years ago
- Text to PDF converter with Unicode support☆74Updated last year
- Crawler for linguistic corpora☆205Updated last year
- convert epub file to txt☆85Updated 4 years ago
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆63Updated last year
- api to retrieve word definitions and other info☆64Updated 2 years ago
- Python wrapper for Wikipedia☆642Updated last week
- Analyse rhyme scheme, metre and form of poems☆130Updated 3 years ago
- 🏆 • 5050 most frequent words in 109 languages☆42Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆154Updated 4 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated 3 weeks ago
- Google News Scraper for languages like Japanese, Chinese... [VPN Support]☆96Updated 3 years ago
- Lexical database of any language☆178Updated 2 years ago