scalingexcellence / scrapy-solr
Scrapy pipeline which allows you to store scrapy items in a solr server.
☆19Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for scrapy-solr
- ☆59Updated 3 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 9 months ago
- Detect and classify pagination links☆14Updated 4 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 7 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Scrapy downloader middleware that stores response HTMLs to disk.☆18Updated 6 months ago
- Virtual patent marking crawler at iproduct.epfl.ch