scalingexcellence / scrapy-solrLinks
Scrapy pipeline which allows you to store scrapy items in a solr server.
☆18Updated 9 years ago
Alternatives and similar repositories for scrapy-solr
Users that are interested in scrapy-solr are comparing it to the libraries listed below
Sorting:
- Scrapes sites. Gets news. Eventually events.☆87Updated 9 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆99Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Pure python script that takes user query and summarizes news related to it.☆25Updated 3 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated 2 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆15Updated 7 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- A command-line and programmatic interface to various social sharecount endpoints.☆30Updated 6 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- ☆59Updated 3 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Parse Popolo JSON data and navigate it with Python☆15Updated 5 years ago
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- Scrapes public information off of LinkedIn☆111Updated 9 years ago
- Some scrapy and web.py exmaples☆79Updated 8 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- Matches a category of Google's Taxonomy to product that is described in any kind of text data☆62Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- A helper to create web scrapers using scrapy selector in a Model based structure☆57Updated 2 years ago
- Seer Interactive's public collection of functions for Google Docs Spreadsheets☆68Updated 8 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- Data Server for Topic Models☆121Updated 2 years ago