commonsearch / cosr-ops
Tools for managing deployment & operations of Common Search.
☆12Updated 8 years ago
Alternatives and similar repositories for cosr-ops:
Users that are interested in cosr-ops are comparing it to the libraries listed below
- Backend of Common Search. Analyses webpages and sends them to the index.☆122Updated 7 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Frontend of Common Search. Go server for fetching and rendering results + HTML5 UI to browse them.☆59Updated 8 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- "About" static website for Common Search☆11Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆34Updated 10 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Paginating the web☆37Updated 11 years ago
- An autoscaling python script for Heroku☆27Updated 12 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆39Updated 7 years ago
- ☆223Updated 10 years ago
- Readability/Boilerpipe extraction in Python☆55Updated 8 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Suma, microservice to manage external links☆46Updated 7 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 3 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 9 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- Nefertari is a REST API framework sitting on top of Pyramid and ElasticSearch☆53Updated 5 years ago
- Access predefined IMAP mailboxes with a browser using one time passwords or a YubiKey☆17Updated 8 years ago
- Commit Counter Chart is a Python Flask app to view git history using D3.js☆39Updated 9 years ago
- Let people pay you for any or no reason.☆504Updated 8 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Wikipedia Live Monitor☆21Updated 4 months ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Feed discovery to share :)☆41Updated 8 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- A command line replacement for zapier and ifttt.☆39Updated 7 years ago