nytlabs / pageinfo
Python module for extracting information from web pages
☆42Updated 10 years ago
Related projects: ⓘ
- A reverse part-of-speech tagger. Give it a list of tags and it spews out matching language.☆23Updated 9 years ago
- a set of services that provide NLP facilities☆25Updated 3 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 8 years ago
- Know more with less☆50Updated 9 years ago
- A command line utility for generating Google Analytics reports that are straightforward to compare across domains, projects or pages.☆41Updated 3 years ago
- A Python version (almost a port) of ProPublica's TableFu☆231Updated 11 years ago
- Pollster polls for share counts of URLs at regular intervals.☆47Updated 8 years ago
- Wrapper for TransparencyData.com API☆23Updated 10 years ago
- Utilities for working with data.☆18Updated 9 years ago
- JSON export and uploading extension for Google Refine☆29Updated 13 years ago
- RiTaJS: A generative language toolkit for JavaScript☆44Updated 3 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Updated 3 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27Updated 10 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆225Updated 6 years ago
- A Python module to access Pinboard.in via its API. This is a fork/modification of mudge/python-delicious☆168Updated 9 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- RGP -- Redis Graph via Python☆30Updated 9 years ago
- Dynamic Deep-Linking and Highlighting☆576Updated 9 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated last year
- A lightweight Python framework for building cli-inspired Slack bots.☆71Updated last year
- Data Pipes for CSV☆117Updated last year
- Publish spreadsheets as interactive tables. And do it on deadline.☆74Updated 7 years ago
- Modularly extensible semantic metadata validator☆83Updated 8 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆34Updated 9 years ago
- Like Tabletop.js — but for Google Docs!☆65Updated 8 years ago
- moxie☆28Updated 8 years ago
- Little JSON object want to be graphs, too!☆17Updated 8 years ago
- Ultra simple API for geocoding a single string against various web services.☆183Updated 10 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 10 years ago