InMobi / docker-hive
Docker image for Apache Hive running on Tez
☆7Updated 10 years ago
Alternatives and similar repositories for docker-hive
Users that are interested in docker-hive are comparing it to the libraries listed below
Sorting:
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- Example Apache Flink cluster on Kubernetes☆22Updated 6 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 5 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- ARCHIVED: Run Debezium/KafkaConnect CDC components in Kubernetes☆24Updated 6 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 8 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Machine Learning Processors for NiFi☆10Updated 7 years ago
- A simple Twitter-Streaming Application for Apache Flink☆21Updated 9 years ago
- NiFi processors for Apache Pulsar☆10Updated 3 years ago
- Sandbox for Apache nifi☆24Updated 3 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Java and Scala client libraries for Concord☆13Updated 8 years ago
- Apache Hadoop HDFS Data Node Scheduler☆13Updated 8 years ago