nytlabs / pageinfo
Python module for extracting information from web pages
☆41Updated 10 years ago
Alternatives and similar repositories for pageinfo:
Users that are interested in pageinfo are comparing it to the libraries listed below
- a set of services that provide NLP facilities☆25Updated 4 years ago
- Know more with less☆50Updated 10 years ago
- Code for Newslynx App☆22Updated 9 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆192Updated 10 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆225Updated 6 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 9 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27Updated 10 years ago
- Little JSON object want to be graphs, too!☆17Updated 9 years ago
- A command-line and programmatic interface to various social sharecount endpoints.☆30Updated 6 years ago
- A statistics extension for Google Refine.☆33Updated 13 years ago
- A reverse part-of-speech tagger. Give it a list of tags and it spews out matching language.☆23Updated 10 years ago
- RiTaJS: A generative language toolkit for JavaScript☆43Updated 4 years ago
- ☆35Updated 14 years ago
- A ready-to-deploy system for aggregating regional boundary data (from shapefiles) and republishing that data via a RESTful JSON API.☆82Updated 3 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 4 years ago
- Wrapper for TransparencyData.com API☆23Updated 11 years ago
- A self-contained example site for django-boundaryservice.☆39Updated 13 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- tips, tools, and tricks for building twitter bots☆17Updated 11 years ago
- Ultra simple API for geocoding a single string against various web services.☆183Updated 11 years ago
- A handy template for building a django prep sports site.☆14Updated 13 years ago
- inference and inspection on freebase data☆105Updated 10 years ago
- Open-source fork of code behind http://everyblock.com/☆99Updated 12 years ago
- A new version of the software used in the Cluetrain listicle☆19Updated 10 years ago
- Fingerpaint with your data.☆18Updated 13 years ago
- A simple transformation/data processing pipeline for CrisisNET☆15Updated 10 years ago
- State of the Unions for the rest of us☆19Updated 10 years ago
- Utilities for working with data.☆20Updated 10 years ago
- Some simple math we use to do journalism.☆78Updated 8 years ago