sigpwned / popular-names-by-country-dataset
A dataset of popular forenames and surnames by country
☆32Updated last year
Alternatives and similar repositories for popular-names-by-country-dataset:
Users that are interested in popular-names-by-country-dataset are comparing it to the libraries listed below
- Hate speech dataset from Stormfront forum manually labelled at sentence level.☆171Updated 4 years ago
- A dataset of multinational first names and last names☆26Updated last year
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Word lists from the web.☆87Updated 8 years ago
- A list of over 5000 US news domains and their social media accounts☆44Updated 2 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆217Updated 2 years ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆75Updated 8 months ago
- A helper library full of URL-related heuristics.☆69Updated last month
- 🌬️urlExpander is a Python package for expanding shortened links (urls).☆73Updated 2 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- ☆79Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆129Updated last year
- Code used for collecting and saving the Top Stories and Trending Stories in Apple News via Appium.☆17Updated 4 years ago
- Mecodify tool for twitter data analysis and visualisation☆42Updated last year
- How are words loaded with meaning? Repository accompanying research by Alina Arseniev-Koehler and Jacob G. Foster, titled "Machine learn…☆41Updated last year
- track changes to the news, where news is anything with an RSS feed☆178Updated 4 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆150Updated 3 months ago
- UNOFFICIAL Python API to interface with Parler.com☆53Updated 9 months ago
- Example scripts for the pushshift dump files☆357Updated 2 weeks ago
- A reddit bot that finds original publish dates on linked articles.☆10Updated 4 months ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆25Updated this week
- datamining the parler datadump☆15Updated 4 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆177Updated 6 months ago
- All languages stopwords collection☆439Updated last year
- A script to download all the available tweets from a Twitter user☆42Updated 2 years ago
- Backend component for Hoaxy, a tool to visualize the spread of claims and fact checking☆140Updated 2 years ago
- Unreliable News Index (for Columbia Journalism Review)☆56Updated 3 years ago
- Cleans Reddit Text Data☆81Updated 5 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago