pushshift / Parallel-NDJSON-ReaderLinks
Parallel NDJSON Reader for Python
☆17Updated 5 years ago
Alternatives and similar repositories for Parallel-NDJSON-Reader
Users that are interested in Parallel-NDJSON-Reader are comparing it to the libraries listed below
Sorting:
- pysmap is a high level interface for working with twitter data.☆21Updated 5 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- ☆32Updated 10 years ago
- ☆74Updated last week
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Read compressed NDJSON .zst files easily☆33Updated 3 years ago
- Utilities for retrieving whitehouse.gov transcripts and matching news quotes to them☆16Updated 10 years ago
- MPEDS Annotation Interface☆18Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆280Updated 7 months ago
- Collecting thoughts about data versioning☆108Updated 6 years ago
- (1) Input a network. (2) Style it. (3) Download the result.☆28Updated 5 years ago
- Render NumPy arrays as HTML tables☆40Updated 3 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Guess gender from first name in Python 2 and 3☆137Updated 3 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- Scrape comments, including their replies, from a YouTube video.☆39Updated 4 years ago
- Python client for thegaurdian api☆73Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Open Source Proxy Demographic module written in Python☆36Updated last year
- Json data set of dream reports, scraped from DreamBank☆21Updated 2 years ago
- Ensemble topic modelling with pLSA☆114Updated 3 years ago
- Package for performing Reddit-based text analysis☆21Updated 6 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- A multi-modal Twitter dataset with 7.6M tweets and 25.6M retweets related to voter fraud claims.☆53Updated 3 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated last year
- smappdragon is a set of tools for working with twitter data.☆29Updated 7 years ago