codelibs / fess-crawlerLinks
Web/FileSystem Crawler Library
☆34Updated last week
Alternatives and similar repositories for fess-crawler
Users that are interested in fess-crawler are comparing it to the libraries listed below
Sorting:
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆196Updated this week
- Web Crawler for Elasticsearch☆235Updated 6 years ago
- Pulsar Data Visualization, gets the data from Pulsar Reporting API, builds different charts and displays them in the browser.☆53Updated 10 years ago
- Suite of tools for detecting changes in web pages and their rendering☆55Updated 2 years ago
- Vert.x web and commandline application to import CSV/XLS/XLSX files into ElasticSearch.☆119Updated 5 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Apache NiFi NLP Processor☆18Updated 2 years ago
- Apache OpenNLP Sandbox☆46Updated last week
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Updated 11 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 4 years ago
- Integration between Stanford NLP and Apache Stanbol☆34Updated 9 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Updated 3 years ago
- Skeleton for Meetup - Building your own recommendation engine in an hour☆29Updated 4 years ago
- Mirror of Apache ManifoldCF☆82Updated 2 weeks ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Machine learning components for Apache UIMA☆132Updated 2 years ago
- Mirror of Apache OpenNLP Add-ons☆19Updated last week
- The next generation of open source search☆93Updated 8 years ago
- An open source search engine for corporate data and websites.☆108Updated 8 years ago
- OptaPlanner workbench 7.x: OptaPlanner extensions to the KIE Workbench☆24Updated 2 years ago
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- open source big data integration, analytics, and visualization☆421Updated 8 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆222Updated 3 years ago
- Elasticsearch plugin for b-bit minhash algorism☆62Updated last year
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 8 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆277Updated 3 years ago
- Sensefy is a federated enterprise semantic search framework built on Apache ManifoldCF, Apache Solr and Apache Stanbol. Development is sp…☆15Updated 3 years ago
- sql interface for solr cloud☆40Updated 3 years ago
- Easy way to get structured stuff into Elasticsearch (CSV, MSSQL, API)☆88Updated 5 years ago
- ☆11Updated 10 years ago