pushshift / Parallel-NDJSON-ReaderLinks
Parallel NDJSON Reader for Python
☆17Updated 5 years ago
Alternatives and similar repositories for Parallel-NDJSON-Reader
Users that are interested in Parallel-NDJSON-Reader are comparing it to the libraries listed below
Sorting:
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- ☆32Updated 10 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Updated 4 months ago
- smappdragon is a set of tools for working with twitter data.☆29Updated 7 years ago
- ☆55Updated 2 years ago
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Updated 4 years ago
- Read compressed NDJSON .zst files easily☆33Updated 3 years ago
- Datasets of the daily Twitter output of Congress.☆115Updated 2 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- pysmap is a high level interface for working with twitter data.☆21Updated 5 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆282Updated 8 months ago
- Tools for collecting social media data around focal events☆85Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆19Updated last month
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Guess gender from first name in Python 2 and 3☆137Updated 5 months ago
- Compilation of Vega-Lite & Altair Tutorials☆23Updated 2 years ago
- Pushshift Telegram Ingest☆86Updated 6 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Updated 10 years ago
- Tokenizer for Twitter and Reddit data☆46Updated 6 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A lightweight end-to-end NLP and visualization platform to make WordStream.☆43Updated 2 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- Download IPEDS complete data files☆38Updated 7 years ago
- Package for performing Reddit-based text analysis☆21Updated 6 years ago
- The code processes URLs in an attempt to consolidate different web addresses that point to the same URL and to remove potentially private…☆23Updated 4 years ago
- MPEDS Annotation Interface☆18Updated 3 years ago
- Generating Wikipedia article embeddings using Word2vec and reading sessions☆18Updated 8 years ago
- Code to produce Russian troll mention network from data published by fivethirtyeight.com☆29Updated 7 years ago