jabraunlin / reddit-user-id
☆79Updated 6 years ago
Alternatives and similar repositories for reddit-user-id:
Users that are interested in reddit-user-id are comparing it to the libraries listed below
- Cleans Reddit Text Data☆81Updated 4 years ago
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- Next generation event data ontology☆72Updated last year
- An on-going dataset consisting of hashtags, n-gram counts and other misc NLP things for covid-19 analysis, stemming from over 100 000 000…☆57Updated 3 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Newsfeed based on GDELT Project☆23Updated 9 months ago
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆20Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆229Updated 2 years ago
- Sample notebooks for using the Global Database of Events, Language and Tone (GDELT).☆17Updated 4 years ago
- Given a set of URLs, this packages detects coordinated link sharing behavior on social media and outputs the network of entities that per…☆74Updated 6 months ago
- scraper for facebook, gab, google and tiktok☆22Updated 7 months ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A dataset of multinational first names and last names☆26Updated last year
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- An open interface to GDELT APIs☆45Updated last year
- Turning news into events since 2014.☆51Updated 7 years ago
- Language-agnostic political event coding using universal dependencies☆18Updated 5 years ago
- Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu☆16Updated 10 years ago
- The twitter sentiment corpus created by Sanders Analytics, it consists of 5513 hand-classified tweets(however, 400 tweets missing due to …☆60Updated 12 years ago
- Convert Wikipedia database dumps into plaintext files☆311Updated 3 years ago
- An analysis of YouTube's political influence through recommendations.☆155Updated last year
- COVID-19 Malicious Domain Research Data☆16Updated 4 years ago
- Datasets of the daily Twitter output of Congress.☆108Updated last year
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last year
- An end-to-end event extraction and summarization system.☆21Updated 4 years ago
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆215Updated last year
- Tools to work with the big reddit JSON data dump.☆250Updated 7 months ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 2 years ago