Norconex / collector-filesystemLinks
Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.
☆22Updated 10 months ago
Alternatives and similar repositories for collector-filesystem
Users that are interested in collector-filesystem are comparing it to the libraries listed below
Sorting:
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- The open source tools for building, maintaining and deploying Topic Maps-based applications.☆57Updated last month
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- Visualization of interaction between entities☆16Updated 8 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated last week
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- Work in progress: a new visualization engine☆34Updated last year
- Explore networks and publish narratives.☆53Updated 4 years ago
- Zorba - the NoSQL processor☆42Updated last year
- JSONiq Implementation that compiles to JavaScript☆66Updated 3 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- A web application for digital assets management.☆52Updated 4 years ago
- Browser version of Hyphe (WIP)☆31Updated 2 months ago
- Uses your app logs to visualize how the data moves between the code, database, HTTP services, message queue, external storages etc.☆23Updated last year
- The HTML5 PivotViewer is a fork of a project that was started by LobsterPot Solutions as a cross browser, cross platform version of the S…☆123Updated 7 months ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Eve is a multipurpose, web based agent platform☆38Updated 10 years ago
- The Open Semantic Framework (OSF) is a generally RESTful middleware API layer that provides the bridge between existing content, structur…☆49Updated 7 years ago
- SOLR bulk indexing utility for the command line.☆44Updated last month
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆270Updated 2 years ago
- The Openlink Structured Data Sniffer (OSDS) is a plugin for the Chrome, Firefox and Opera browsers that detects and shows structured data…☆125Updated 3 years ago
- Docker container to provide Apache Tika RESTful API☆41Updated 9 years ago
- ☆36Updated last year
- Blazegraph Tinkerpop3 Implementation☆62Updated 4 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- Callimachus is a highly scalable platform for creating and running data-driven websites☆95Updated 8 years ago
- Solr client and user interface for search☆22Updated last year
- This is the facade for installation and access to the individual components☆15Updated 7 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Updated 9 years ago
- Fast in-memory graph structure, powering Gephi☆75Updated this week