sigpwned / popular-names-by-country-dataset
A dataset of popular forenames and surnames by country
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for popular-names-by-country-dataset
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)☆83Updated 2 years ago
- The Python library for names.☆843Updated last month
- Text databases of last names from various countries☆277Updated last year
- Wayback Machine API interface & a command-line tool☆478Updated 8 months ago
- Word lists from the web.☆83Updated 8 years ago
- A CSV file with US given names (first name) and their associated nicknames or diminutive names.☆290Updated last week
- Download data on all of Donald Trump's (@realDonaldTrump) tweets☆41Updated 6 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated last month
- Full list of US states and cities☆269Updated 3 months ago
- Reference implementation for measuring linguistic cultural distances between individuals and groups.☆14Updated 5 years ago
- A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆422Updated 8 months ago
- A simple fuzzy matching set for python strings☆223Updated 2 months ago
- Python client for RealClearPolitics.☆28Updated 2 years ago
- Predict Race and Ethnicity Based on the Sequence of Characters in a Name☆234Updated 5 months ago
- Zest Race Predictor☆28Updated last month
- Searching for misspelling, bad grammar, and violations of the Manual of Style in Wikipedia☆13Updated last month
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆35Updated 2 years ago
- ☆121Updated last year
- Fortune 500 company lists since 1955 in CSV format, mostly parsed using Beautiful Soup☆86Updated 3 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆332Updated 2 years ago
- A Python scraper for Goodreads books and reviews.☆274Updated 5 months ago
- Offline database of synonyms/thesaurus☆189Updated 9 months ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆109Updated 8 months ago
- The largest English-language thesaurus☆278Updated last year
- 📛 Fuzzy Name Matching with Machine Learning☆257Updated 4 months ago
- Bot that determines if a post in a circlejerk or parody subreddit has a relevant post in its original subreddit, and links it in the comm…☆58Updated 3 months ago
- A package for easily working with US and state metadata☆481Updated 3 months ago
- All the words from Google Books, sorted by frequency☆109Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- Python script to scrap www.goodreads.com books shelves.☆14Updated 9 months ago