kaqqao / nutch-element-selector
Nutch 2.3.1 plugin for whitelisting/blacklisting specific HTML elements
☆13Updated 2 years ago
Alternatives and similar repositories for nutch-element-selector:
Users that are interested in nutch-element-selector are comparing it to the libraries listed below
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- Vizlinc☆14Updated 9 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 8 years ago
- HTTP Shell is a CLI tool based on the Kui framework that provides developers a modern alternative to http clients for interacting with AP…☆12Updated 4 years ago
- ☆20Updated 7 years ago
- ☆48Updated 7 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- Node.js based proxy to make a solr instance read-only.☆27Updated 8 years ago
- fuzzydb is a fuzzy matching database engine capable of providing human-like search results that make life much easier for users of websit…☆19Updated last year
- Compiler for writing DeepDive applications in a Datalog-like language — ⚠️🚧🛑 REPO MOVED TO DEEPDIVE 👇🏿☆19Updated 7 years ago
- Focused Crawler for VT's CTRNet☆10Updated 11 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- Masques is a distributed social network.☆36Updated 8 years ago
- Pattern-of-Behavior Search Tool☆11Updated 2 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆15Updated 9 years ago
- ☆13Updated 9 years ago
- Home of RDF2Go and RDFReactor☆13Updated 8 years ago
- Scraper built with Scrapy.☆14Updated 5 months ago
- Sandbox for Apache nifi☆24Updated 3 years ago
- Wikipedia River Plugin for elasticsearch (STOPPED)☆74Updated last year
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆23Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- M-ATOLL: A Framework for the Lexicalization of Ontologies in Multiple Languages☆10Updated 7 years ago
- iCQA - Intelligent Community Question Answering Framework☆32Updated 8 years ago