pushshift / Parallel-NDJSON-ReaderLinks
Parallel NDJSON Reader for Python
☆17Updated 6 years ago
Alternatives and similar repositories for Parallel-NDJSON-Reader
Users that are interested in Parallel-NDJSON-Reader are comparing it to the libraries listed below
Sorting:
- ☆76Updated this week
- A library that enables you to easily parse and transform ORCID metadata between XML, JSON and Java objects☆20Updated 4 years ago
- Read compressed NDJSON .zst files easily☆35Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆286Updated 11 months ago
- A lightweight end-to-end NLP and visualization platform to make WordStream.☆43Updated 2 years ago
- Python client for thegaurdian api☆73Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 7 years ago
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Ensemble topic modelling with pLSA☆114Updated 4 years ago
- Force-Atlas 2 graph layout in networkx☆22Updated 11 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆15Updated 6 years ago
- Public repository containing the dataset and code for training the models in "Ten Social Dimensions of Conversations and Relationships" (…☆14Updated 4 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Text Thresher crowd sourced text annotator☆17Updated 8 years ago
- Tools for collecting social media data around focal events☆85Updated 3 years ago
- Daily refreshed data on representation certification and unfair labor cases from nlrb.gov☆20Updated 2 months ago
- Python wrapper for a C++ Double Metaphone☆15Updated last month
- ☆32Updated 10 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Datasets of the daily Twitter output of Congress.☆115Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆78Updated 2 months ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- ☆37Updated 7 years ago
- Compilation of Vega-Lite & Altair Tutorials☆23Updated 2 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Using stochastic block models for topic modeling☆198Updated last year
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Updated 10 years ago