nytlabs / pageinfoLinks
Python module for extracting information from web pages
☆41Updated 10 years ago
Alternatives and similar repositories for pageinfo
Users that are interested in pageinfo are comparing it to the libraries listed below
Sorting:
- A reverse part-of-speech tagger. Give it a list of tags and it spews out matching language.☆23Updated 10 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆225Updated 6 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27Updated 11 years ago
- Know more with less☆50Updated 10 years ago
- A simple transformation/data processing pipeline for CrisisNET☆15Updated 10 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- A command line utility for generating Google Analytics reports that are straightforward to compare across domains, projects or pages.☆41Updated 4 years ago
- Some simple math we use to do journalism.☆78Updated 8 years ago
- Publish spreadsheets as interactive tables. And do it on deadline.☆74Updated 8 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- A hacky Markdown-to-JSON parser for easier copy editing.☆25Updated 9 years ago
- Bash-style pipelining for Python generators.☆17Updated 14 years ago
- A statistics extension for Google Refine.☆33Updated 13 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- Wrapper for TransparencyData.com API☆23Updated 11 years ago
- A handy template for building a django prep sports site.☆14Updated 13 years ago
- backchan.nl is a tool for involving audiences in presentations by letting them suggest questions and vote on each other's questions.☆30Updated 12 years ago
- A Django-based open source CMS for newspapers☆16Updated 13 years ago
- A self-contained example site for django-boundaryservice.☆39Updated 14 years ago
- Whippersnapper is an automated screenshot tool to keep a visual history of content on the web.☆55Updated 9 years ago
- Analysis for a blog post on cartograms.☆29Updated 11 months ago
- Little JSON object want to be graphs, too!☆17Updated 9 years ago
- Saving activists time, one bot at a time.☆37Updated 8 years ago
- ☆36Updated 7 years ago
- An interactive infographic showing how HBO loves to reuse actors.☆36Updated last year
- Data and analysis supporting several passages in the BuzzFeed News article, "The New American Slavery: Invited To The U.S., Foreign Worke…☆28Updated 8 years ago
- Manage your dataset downloads.☆43Updated 8 years ago
- HiiDef web spider framework, powers http://flavors.me☆18Updated 13 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- State of the Unions for the rest of us☆19Updated 10 years ago