codelibs / fess-crawlerLinks
Web/FileSystem Crawler Library
☆29Updated this week
Alternatives and similar repositories for fess-crawler
Users that are interested in fess-crawler are comparing it to the libraries listed below
Sorting:
- ImageCat is an Apache OODT RADIX application that uses Apache Solr, Apache Tika and Apache OODT to ingest 10s of millions of files (image…☆96Updated 6 years ago
- Document Enrichment plugin for Elasticsearch☆28Updated 4 months ago
- Elasticsearch plugin offering Neo4j integration for Personalized Search☆156Updated 4 years ago
- Skeleton for Meetup - Building your own recommendation engine in an hour☆29Updated 4 years ago
- Suite of tools for detecting changes in web pages and their rendering☆54Updated last year
- This plugin provides a useful feature for multi-language☆14Updated 3 years ago
- Neo4j ElasticSearch Integration☆213Updated 4 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Elasticsearch plugin for b-bit minhash algorism☆63Updated last year
- open source big data integration, analytics, and visualization☆418Updated 8 years ago
- The tool which imports raw JSON to ElasticSearch in one line of commands☆67Updated 6 years ago
- Apache NiFi NLP Processor☆18Updated last year
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Pulsar Data Visualization, gets the data from Pulsar Reporting API, builds different charts and displays them in the browser.☆53Updated 9 years ago
- Vert.x web and commandline application to import CSV/XLS/XLSX files into ElasticSearch.☆119Updated 4 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 4 months ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Integration between Stanford NLP and Apache Stanbol☆34Updated 9 years ago
- An open source search engine for corporate data and websites.☆106Updated 8 years ago
- A quick Elasticsearch/Logstash/Kibana (ELK) 7.x environment to quickly ingest realtime filtered tweets, perform Natural Language Processi…☆16Updated last year
- Tools for iterative knowledge base development with DeepDive☆120Updated 6 years ago
- This plugin provides a feature to change top N documents in a search result.☆56Updated 2 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Implementation of Vision Based Page Segmentation algorithm in Java☆102Updated 5 years ago
- Building recommenders with Elastic Graph!☆37Updated 4 years ago
- Easy way to get structured stuff into Elasticsearch (CSV, MSSQL, API)☆88Updated 5 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 9 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago