Norconex / collector-filesystem
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
☆22Updated 6 months ago
Alternatives and similar repositories for collector-filesystem:
Users that are interested in collector-filesystem are comparing it to the libraries listed below
- Open Source, Distributed, Big Data Enterprise Search Engine☆69Updated 2 weeks ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆187Updated this week
- Uses your app logs to visualize how the data moves between the code, database, HTTP services, message queue, external storages etc.☆23Updated 11 months ago
- Index and search PDF files using Apache Lucene and PDF Box☆43Updated 4 years ago
- Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, what…☆34Updated 5 months ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 12 years ago
- Eve is a multipurpose, web based agent platform☆38Updated 10 years ago
- A small Docker built for the OCRopus OCR system.☆20Updated 7 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago
- SPARQLGraph - Visual Query Builder for Biological RDF Databases☆16Updated 9 years ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- Fast in-memory graph structure, powering Gephi☆74Updated 4 months ago
- Merge Dirty Data with Clean Reference Tables☆35Updated 3 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆26Updated last month
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Automatically exported from code.google.com/p/xml2json-xslt☆38Updated 9 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Tools for exploring the contents of web archive files.☆39Updated 4 years ago
- Mobi is a decentralized, federated, and distributed graph data platform for teams and communities to publish and discover data, data mode…☆47Updated 2 months ago
- Skeleton for Meetup - Building your own recommendation engine in an hour☆29Updated 3 years ago
- An open source search engine for corporate data and websites.☆106Updated 7 years ago
- Crawling github data☆29Updated last year
- Work in progress: a new visualization engine☆34Updated 9 months ago
- This is the facade for installation and access to the individual components☆15Updated 6 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- A Node.js tool to examine the correctness of Open Data Metadata and build custom dataset profiles☆12Updated last year
- Querqy for Elasticsearch☆45Updated 3 weeks ago
- Open Semantic Search Appliance (VM)☆12Updated 4 years ago