Norconex / collector-filesystemLinks
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
☆22Updated 9 months ago
Alternatives and similar repositories for collector-filesystem
Users that are interested in collector-filesystem are comparing it to the libraries listed below
Sorting:
- A java library for creating standalone, portable, schema-full object databases supporting pagination and faceted search, and offering str…☆16Updated 8 years ago
- Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, what…☆34Updated last month
- Fast in-memory graph structure, powering Gephi☆75Updated last month
- Uses your app logs to visualize how the data moves between the code, database, HTTP services, message queue, external storages etc.☆23Updated last year
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆190Updated 3 weeks ago
- ☆25Updated 8 years ago
- Code to accompany presentation on Neo4j development practices☆21Updated 10 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- Secure REST service to index, search, retrieve and aggregate content from heterogeneous sources.☆20Updated 8 months ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 12 years ago
- Java library for parsing semi-structured text files☆65Updated 3 years ago
- Web/FileSystem Crawler Library☆29Updated this week
- A simple CMIS 1.1 server based on chemistry opencmis☆16Updated 7 months ago
- High-security graph database☆63Updated 2 years ago
- Blazegraph Tinkerpop3 Implementation☆61Updated 4 years ago
- A java library for stored queries☆16Updated last year
- Web application to download and schedule reports from Elasticsearch☆11Updated 8 years ago
- Segrada - Semantic Graph Database☆69Updated 3 months ago
- An open source search engine for corporate data and websites.☆106Updated 7 years ago
- Zulia Search Engine☆33Updated this week
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- Java library for computing structural differences between XML document trees☆22Updated 10 years ago
- A generator of domain-specific language (DSL) editors for web applications and cloud IDEs.☆79Updated 2 years ago
- Open Semantic Visual Linked Data Graph Explorer: Open Source tool (web app) and user interace (UI) for discovery, exploration and visuali…☆82Updated 5 years ago
- Talend Component Kit (implementation repository)☆33Updated this week
- Core API for Silverpeas☆50Updated this week
- Index and search PDF files using Apache Lucene and PDF Box☆44Updated 4 years ago
- SOLR bulk indexing utility for the command line.☆44Updated 2 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Updated 7 years ago
- Mirror of Apache MetaModel Membrane☆16Updated 6 years ago