kiasar / Dictionary_crawler
This is a python code based on Scrapy package to crawl famous online dictionaries like Oxford, Longman, Cambridge, Webster, and Collins to make a dataset
☆102Updated last year
Alternatives and similar repositories for Dictionary_crawler:
Users that are interested in Dictionary_crawler are comparing it to the libraries listed below
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- A Python Wiktionary Parser☆357Updated last month
- api to retrieve word definitions and other info☆64Updated 2 years ago
- ☆26Updated 2 years ago
- Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code☆32Updated last month
- This is project convert The Online Plain Text English Dictionary (OPTED) to SQLite database and JSON files☆86Updated 4 years ago
- Extract and align grammar patterns from English sentences.☆54Updated 2 years ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆23Updated 4 years ago
- Fifteen Thousand Useful Phrases, by Greenville Kleiser☆54Updated 8 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆53Updated 9 years ago
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 6 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆243Updated 2 years ago
- British English pronunciation dictionary☆93Updated 7 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- This is an SQL file of Oxford English Dictionary. It includes more than 41,OOO words! Just import the SQL.☆47Updated 10 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- A modern, interlingual wordnet interface for Python☆236Updated last week
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆72Updated 3 months ago
- Arabic Transliteration in Python☆36Updated 11 years ago
- Converts English text to IPA notation☆380Updated last year
- Stand-alone WordNet API☆48Updated 3 years ago
- Crawler for linguistic corpora☆205Updated last year
- A simple phonetic respelling for the English language☆10Updated 2 years ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆51Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated 3 weeks ago
- Translate HTML using Argos Translate☆50Updated last year
- linguistics backend☆41Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆178Updated 6 months ago
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆45Updated 2 years ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago