pushshift / zreader
Read compressed NDJSON .zst files easily
☆32Updated 2 years ago
Alternatives and similar repositories for zreader:
Users that are interested in zreader are comparing it to the libraries listed below
- Pushshift Telegram Ingest☆86Updated 5 years ago
- Comprehensive database of ratings for 11k news domains☆25Updated last year
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Updated 11 months ago
- Cleans Reddit Text Data☆83Updated 5 years ago
- Official repository for the ICWSM '21 paper "More than meets the tie: Examining the Role of Interpersonal Relationships in Social Network…☆12Updated 2 years ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆53Updated 4 years ago
- ☆22Updated 4 years ago
- Script to extract highly probable bots for further analysis☆12Updated 7 years ago
- Tokenizer for Twitter and Reddit data☆47Updated 6 years ago
- Next generation event data ontology☆73Updated last year
- Classification of incivility in Reddit posts☆19Updated 4 years ago
- ☆31Updated 9 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆54Updated 2 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 3 years ago
- Using stochastic block models for topic modeling☆195Updated last year
- Memes Processing Pipeline that enables the track of memes across multiple Web communities.☆57Updated 5 years ago
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆68Updated 2 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆275Updated 2 months ago
- Public repository containing the dataset and code for training the models in "Ten Social Dimensions of Conversations and Relationships" (…☆14Updated 4 years ago
- DEPRECATED - The Concept Mover's Distance Method is now available in the text2map package. Concept Mover's Distance is a way to measure…☆27Updated 3 years ago
- Fetch movie data from IMDB and output in JSON format.☆10Updated 4 years ago
- Repository of data on web domains.☆17Updated last year
- Convert text-intensive ICEWS data on Dataverse to conventional ISO-3166 and CAMEO codes☆14Updated 4 years ago
- Harassment Lexicon and Corpus☆30Updated 6 years ago
- Repository for public code and data associated with the paper "Fake News on Twitter During the 2016 U.S. Presidential Election☆12Updated 5 years ago
- A library that will eventually help people wanting to do Data Mining on Twitter☆22Updated 2 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago