Norconex / collector-filesystem
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
☆22Updated last month
Related projects ⓘ
Alternatives and complementary repositories for collector-filesystem
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- The MIL-STD-498 DIDs converted to HTML.☆24Updated 12 years ago
- SOLR bulk indexing utility for the command line.☆45Updated 3 months ago
- Simple search results with Solr and EmberJS☆58Updated 5 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- Work in progress: a new visualization engine☆34Updated 5 months ago
- Convert git commit history to solr index☆73Updated 11 years ago
- Netarchivesuite 5.X development☆19Updated last month
- Web/FileSystem Crawler Library☆29Updated this week
- Node.js based proxy to make a solr instance read-only.☆27Updated 8 years ago
- Open Source, Distributed, Big Data Enterprise Search Engine☆69Updated last week
- Skeleton for Meetup - Building your own recommendation engine in an hour☆29Updated 3 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 2 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆183Updated this week
- Uses your app logs to visualize how the data moves between the code, database, HTTP services, message queue, external storages etc.☆23Updated 7 months ago
- Secure REST service to index, search, retrieve and aggregate content from heterogeneous sources.☆19Updated last month
- GUI tool to map any JSON-based Web API, plus node server to access it as if it were a HAL Hypermedia API☆27Updated 6 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago
- Cuttlefish aims to be a highly extensible visualization and analysis platform for all kinds of network data☆17Updated 7 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 2 years ago
- Enterprise backend as a service☆70Updated 6 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆25Updated 5 months ago
- Web application to download and schedule reports from Elasticsearch☆11Updated 7 years ago
- Blazegraph Tinkerpop3 Implementation☆60Updated 4 years ago
- Docker container to provide Apache Tika RESTful API☆40Updated 8 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Core API for Silverpeas☆49Updated this week
- Easy generation of dummy Neo4j graphs with PHP☆34Updated 4 years ago