sigpwned / popular-names-by-country-datasetLinks
A dataset of popular forenames and surnames by country
☆52Updated 2 years ago
Alternatives and similar repositories for popular-names-by-country-dataset
Users that are interested in popular-names-by-country-dataset are comparing it to the libraries listed below
Sorting:
- The Python library for names.☆964Updated 8 months ago
- Text databases of last names from various countries☆281Updated 3 years ago
- 📦 A list, huge one (~200K) of human male/female first/last names.☆55Updated 2 years ago
- Index Common Crawl archives in tabular format☆124Updated last week
- A helper library full of URL-related heuristics.☆73Updated 2 months ago
- A database of courts, tests and other experiments☆97Updated 2 months ago
- Public API client for GETTR, a "non-bias [sic] social network," designed for data archival and analysis.☆95Updated last week
- Word lists from the web.☆93Updated 9 years ago
- JSON representations of all Supreme Court cases since 1956☆26Updated 6 years ago
- Offline database of synonyms/thesaurus☆206Updated last year
- Now included in rigour☆152Updated 3 weeks ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆205Updated 7 years ago
- A database of court reporters, tests and other experiments☆117Updated 2 weeks ago
- Find legal citations in any block of text☆190Updated 2 months ago
- Tracking the far right on Twitter☆63Updated 2 years ago
- Unreliable News Index (for Columbia Journalism Review)☆56Updated 3 years ago
- A list of over 5000 US news domains and their social media accounts☆48Updated 2 years ago
- A Python library designed for scraping data from the SCP wiki.☆16Updated 5 years ago
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆38Updated 3 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆194Updated 3 weeks ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆119Updated last year
- Old Twint style, but zero fat.☆281Updated 2 years ago
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆42Updated last week
- A tool to detect whether a PDF has a bad redaction☆159Updated last week
- JSON file of all games available on Steam with prices and additional data from Steam Spy, GameFAQs, Metacritic, IGDB and HLTB.☆93Updated 2 years ago
- Estimating the age of web resources☆97Updated 6 months ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆30Updated last week
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆130Updated last month
- A dataset of multinational first names and last names☆27Updated 2 years ago
- English stopwords collection☆166Updated 9 years ago