kaqqao / nutch-element-selector
Nutch 2.3.1 plugin for whitelisting/blacklisting specific HTML elements
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for nutch-element-selector
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 2 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 8 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- Twitter-Kafka Data Pipeline☆16Updated this week
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- Node.js based proxy to make a solr instance read-only.☆27Updated 8 years ago
- Elasticsearch REPL built on top of Jest☆23Updated 9 years ago
- This plugin provides a useful feature for multi-language☆13Updated 2 years ago
- Performance dashboard☆19Updated last month
- ☆14Updated 9 years ago
- The complete Buddycloud stack in a VM☆23Updated 8 years ago
- Storm / Solr Integration☆19Updated 9 months ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- Document Imaging Archive System. Home document imaging, with OCR. Scan documents (with SANE) or import ODF documents, assign tags. Use op…☆24Updated 9 years ago
- M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages☆10Updated 7 years ago
- ☆10Updated 7 years ago
- VoltDB Click Stream Processing Example.☆16Updated 6 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Silk is a port of Kibana 4 project.☆69Updated 8 years ago
- Usage examples for Divolte collector☆17Updated 7 years ago
- Highly performant version of open-text-summarizer☆38Updated 10 years ago
- The next generation of open source search☆91Updated 7 years ago
- Secure REST service to index, search, retrieve and aggregate content from heterogeneous sources.☆19Updated last month
- Full-stack monitoring and alerting Python library.☆15Updated 3 years ago
- d3 based visualization library - svg & canvas☆14Updated 7 years ago
- Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿☆19Updated 7 years ago
- Code to index HDFS to Solr using MapReduce☆51Updated 5 years ago
- A Storm based web crawler with Cassandra backend☆28Updated 11 years ago