taikuukaits / SimpleWordlists
Word lists from the web.
☆87Updated 8 years ago
Alternatives and similar repositories for SimpleWordlists:
Users that are interested in SimpleWordlists are comparing it to the libraries listed below
- Lists of most-frequently-used english words / nouns / verbs etc.☆60Updated 4 years ago
- Offline database of synonyms/thesaurus☆193Updated last year
- Organised collection of common file extensions☆153Updated 10 months ago
- 📦 A list, huge one (~200K) of human male/female first/last names.☆48Updated last year
- Extract text from HTML☆134Updated 4 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆75Updated 6 months ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆144Updated this week
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆177Updated 5 months ago
- Letterpress Word List☆405Updated 9 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A very long list of English profanity.☆254Updated 3 months ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆88Updated 2 years ago
- A Python API to the Internet Archive Wayback Machine☆71Updated 7 months ago
- Lightweight package to query popular search engines and scrape for result titles, links and descriptions☆472Updated 10 months ago
- Probably the most advanced command-line english dictionary ever.☆38Updated 5 years ago
- A Python tool for downloading videos from vk.com☆21Updated 5 years ago
- A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and…☆19Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆97Updated 3 weeks ago
- The largest English-language thesaurus☆289Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆243Updated 2 years ago
- Streaming WARC/ARC library for fast web archive IO☆408Updated 3 months ago
- Yet another googlesearch - A Python library for executing intelligent, realistic-looking, and tunable Google searches.☆273Updated 11 months ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 6 months ago
- Scrapes Google Books Ngram data to create a long word list☆13Updated last year
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆11Updated 11 months ago
- Crawler for linguistic corpora☆205Updated last year
- Tool for extracting comments or subtitles from youtube video's☆142Updated 3 years ago
- URLTeam's second generation of URL shortener archiving tools☆75Updated 2 months ago
- Ultimate Website Sitemap Parser☆197Updated 2 weeks ago