andybab / OfflineESIndexGenerator
Offline Elasticsearch index generator
☆26Updated 3 years ago
Alternatives and similar repositories for OfflineESIndexGenerator:
Users that are interested in OfflineESIndexGenerator are comparing it to the libraries listed below
- Offline Hadoop Elasticsearch Index Building and Tools For Lambda Architectures☆31Updated last year
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Docker containers for Druid nodes☆27Updated 8 years ago
- Splittable Gzip codec for Hadoop☆70Updated 3 weeks ago
- functionstest☆33Updated 8 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- A library to expose more of Apache Spark's metrics system☆146Updated 5 years ago
- Few things we've met during our etl project based on spark☆24Updated 6 years ago
- Set of real time stream processing algorithms that can be used by big data streaming platform☆72Updated 4 years ago
- Approximate Nearest Neighbors in Spark☆174Updated 3 years ago
- Cascading on Apache Flink®☆54Updated last year
- Kite SDK Examples☆99Updated 3 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Elasticsearch plugin for b-bit minhash algorism☆62Updated 7 months ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 9 years ago
- Test driven learning of Cascading.☆38Updated 5 years ago
- Text Classification Engine☆36Updated 5 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆125Updated 3 weeks ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Notes about Spark Streaming in Apache Spark☆58Updated 7 years ago
- Apache Calcite Tutorial☆33Updated 8 years ago
- Querqy for Elasticsearch☆45Updated last week
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 5 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- MLeap demo repository for use with MLeap blog posts☆11Updated 8 years ago
- This plugin provides a feature to change top N documents in a search result.☆56Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Stratosphere is now Apache Flink.☆196Updated last year
- flink-jpmml is a fresh-made library for dynamic real time machine learning predictions built on top of PMML standard models and Apache Fl…☆96Updated 5 years ago