nytlabs / pageinfoLinks
Python module for extracting information from web pages
☆41Updated 11 years ago
Alternatives and similar repositories for pageinfo
Users that are interested in pageinfo are comparing it to the libraries listed below
Sorting:
- Know more with less☆50Updated 11 years ago
- A command-line and programmatic interface to various social sharecount endpoints.☆30Updated 7 years ago
- a set of services that provide NLP facilities☆25Updated 5 years ago
- RiTaJS: A generative language toolkit for JavaScript☆43Updated 5 years ago
- A reverse part-of-speech tagger. Give it a list of tags and it spews out matching language.☆23Updated 10 years ago
- PANDA: A Newsroom Data Appliance☆208Updated 3 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆226Updated 7 years ago
- A Python version (almost a port) of ProPublica's TableFu☆230Updated 12 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 4 years ago
- Little JSON object want to be graphs, too!☆17Updated 10 years ago
- Some simple math we use to do journalism.☆79Updated 9 years ago
- Dot's Pot Ting☆81Updated 10 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 11 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 10 years ago
- Akara is an open-source (Apache2 license) Web framework specialized for RESTful data services, especially involving XML and other semi-st…☆25Updated 12 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Updated 5 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- A deprecated Python wrapper for the DocumentCloud API☆62Updated 5 years ago
- A contextual news development environment.☆49Updated 11 years ago
- Stylesheets for clean geographic data visualization.☆56Updated 8 years ago
- Wrapper for TransparencyData.com API☆23Updated 11 years ago
- Open-source fork of code behind http://everyblock.com/☆99Updated 13 years ago
- backchan.nl is a tool for involving audiences in presentations by letting them suggest questions and vote on each other's questions.☆30Updated 13 years ago
- legacy backend for Open States☆87Updated 5 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27Updated 11 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 5 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆85Updated 2 years ago
- A statistics extension for Google Refine.☆32Updated 14 years ago
- A self-contained example site for django-boundaryservice.☆38Updated 14 years ago