commonsearch / cosr-ops
Tools for managing deployment & operations of Common Search.
☆12Updated 8 years ago
Alternatives and similar repositories for cosr-ops:
Users that are interested in cosr-ops are comparing it to the libraries listed below
- Backend of Common Search. Analyses webpages and sends them to the index.☆122Updated 7 years ago
- Modularly extensible semantic metadata validator☆83Updated 9 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- buildstrap: when buildout+pip=♥☆16Updated 8 years ago
- "About" static website for Common Search☆11Updated 8 years ago
- Let people pay you for any or no reason.☆503Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A very naive classifier to figure out if a sentence contains dirty words☆33Updated 9 years ago
- An autoscaling python script for Heroku☆27Updated 12 years ago
- ☆224Updated 9 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- Enhance your feature engineering workflow with Kodiak☆19Updated last year
- 🌆 TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!☆29Updated 6 years ago
- feedparser but faster and worse☆103Updated 3 years ago
- E-commerce scraping and analytics platform.☆52Updated 9 years ago
- Faster replacement for Python's urlparse module☆46Updated 6 years ago
- Paginating the web☆37Updated 11 years ago
- Django API Tools is an add-on which allows developers to run RESTful APIs alongside websites using Forms/Templates.☆14Updated 10 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 2 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41Updated 7 years ago
- Readability/Boilerpipe extraction in Python☆55Updated 8 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- make logging fun again☆19Updated 7 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- Dynamic data analysis over the web. The logic to your data dashboards.☆156Updated 9 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- A Scrapy pipeline to categorize items using MonkeyLearn☆38Updated 7 years ago