nytlabs / pageinfoLinks
Python module for extracting information from web pages
☆41Updated 11 years ago
Alternatives and similar repositories for pageinfo
Users that are interested in pageinfo are comparing it to the libraries listed below
Sorting:
- a set of services that provide NLP facilities☆25Updated 4 years ago
- A command-line and programmatic interface to various social sharecount endpoints.☆30Updated 7 years ago
- PANDA: A Newsroom Data Appliance☆207Updated 3 years ago
- RiTaJS: A generative language toolkit for JavaScript☆43Updated 4 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27Updated 11 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 10 years ago
- Social network for hypothesis formation, evidence collection, and collective decision-making.☆40Updated 13 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- A library for accessing a spreadsheet as a native Python object suitable for templating.☆226Updated 7 years ago
- Know more with less☆50Updated 10 years ago
- A Python version (almost a port) of ProPublica's TableFu☆230Updated 12 years ago
- webstore is a web-api enabled datastore backed onto sql databases especially sqlite. It supports the RESTful JSON APIs standard to nosql …☆40Updated 6 years ago
- Command line tool for manipulating and analyzing text☆29Updated 3 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- TweeQL is a Query Language for Tweets: SELECT brand(text) AS brand, sentiment(text) AS sentiment FROM twitter_sample;☆193Updated 11 years ago
- Wrapper for TransparencyData.com API☆23Updated 11 years ago
- Akara is an open-source (Apache2 license) Web framework specialized for RESTful data services, especially involving XML and other semi-st…☆25Updated 11 years ago
- An AIML alternative, YAML based. Aerolito works like a simulation of natural language processing.☆20Updated 14 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 7 years ago
- Dot's Pot Ting☆81Updated 10 years ago
- A contextual news development environment.☆49Updated 10 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 9 years ago
- ☆154Updated 14 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- DEPRECATED - Development on PopIt has stopped and it is no longer being maintained☆76Updated 8 years ago
- Data Pipes for CSV☆115Updated 2 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- Code for Newslynx App☆22Updated 10 years ago
- rapid nlp prototyping☆71Updated 3 years ago
- A reverse part-of-speech tagger. Give it a list of tags and it spews out matching language.☆23Updated 10 years ago