taikuukaits / SimpleWordlists
Word lists from the web.
☆79Updated 8 years ago
Related projects: ⓘ
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆81Updated 2 years ago
- Lists of most-frequently-used english words / nouns / verbs etc.☆43Updated 4 years ago
- Offline database of synonyms/thesaurus☆184Updated 7 months ago
- Hide text in plain sight using invisible zero-width characters☆190Updated 3 years ago
- Ruby commands for ARIN's Reg-RWS and Whois-RWS☆45Updated last year
- Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushs…☆50Updated 2 years ago
- Calculate syllable count for English words.☆32Updated 6 months ago
- Words categorized by topic.☆294Updated last year
- A very long list of English profanity.☆234Updated last year
- List of English synonyms and antonyms parsed from the public domain book of James C. Fernald, 1896☆43Updated 5 years ago
- Probably the most advanced command-line english dictionary ever.☆37Updated 4 years ago
- 📦 A list, huge one (~200K) of human male/female first/last names.☆37Updated 10 months ago
- Scrape, Hunt, and Transform names and usernames☆104Updated last year
- Integrated web scraper and email account data breach comparison tool☆74Updated last month
- Python framework to scrape Pastebin pastes and analyze them☆122Updated last year
- A definitive guide to generating usernames for OSINT purposes☆144Updated 3 months ago
- A tool to extract useful data from documents☆154Updated 3 years ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆78Updated 3 years ago
- Grammarify is a npm package that safely cleans up text that has mispellings, improper capitalization, lexical illusions, among other thin…☆66Updated last year
- ☆73Updated this week
- A reddit bot that finds original publish dates on linked articles.☆10Updated 2 months ago
- An authorship attribution project with particular emphasis on Twitter analysis☆16Updated 2 years ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆109Updated 7 months ago
- A scraper written in python to scrape the public pastebin archive and filter with customizable and extensible YARA rules☆42Updated 4 months ago
- Public API client for GETTR, a "non-bias [sic] social network," designed for data archival and analysis.☆89Updated 2 months ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆167Updated 2 weeks ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆20Updated 6 years ago
- A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning tech…☆69Updated last week
- convert SQL dumps and other leaked db dump formats to CSV☆43Updated 4 months ago
- A minimum-dependency ECMAScript client library and CLI tool for Parler – a "free speech" social network that accepts real money to buy "i…☆68Updated 3 months ago