sigpwned / popular-names-by-country-dataset
A dataset of popular forenames and surnames by country
☆32Updated last year
Alternatives and similar repositories for popular-names-by-country-dataset:
Users that are interested in popular-names-by-country-dataset are comparing it to the libraries listed below
- Datasets of the daily Twitter output of Congress.☆108Updated last year
- Pushshift Telegram Ingest☆86Updated 5 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆220Updated last year
- roll a wikipedia dump into mongo☆242Updated 9 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- The Python library for names.☆887Updated last month
- Analyzing words Redditors aren't sure how to spell☆49Updated 6 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 2 years ago
- Reference datasets on historic and current names in the US☆46Updated 10 years ago
- Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)☆16Updated 3 years ago
- An open interface to GDELT APIs☆46Updated last year
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- Mecodify tool for twitter data analysis and visualisation☆42Updated last year
- Poetic processing, for Python.☆40Updated 11 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆107Updated 6 years ago
- A small command line tool and set of functions for studying coordination networks in Twitter and other social media data.☆76Updated 2 years ago
- UNOFFICIAL Python API to interface with Parler.com☆53Updated 8 months ago
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- The largest English-language thesaurus☆289Updated 2 years ago
- An Ancient Greek Morphology Tagger☆26Updated last year
- A dataset of multinational first names and last names☆26Updated last year
- ☆25Updated 5 years ago
- TweetedAt tells the time of a tweet based on its tweet id☆47Updated 4 years ago
- A Python Twitter bot posting recently active questions from Stack Overflow. Tweaked to run on AWS Lambda.☆10Updated 5 years ago
- A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.