sequenceiq / docker-hadoop-build
Docker contaier to build Apache Hadoop
☆12Updated 9 years ago
Alternatives and similar repositories for docker-hadoop-build:
Users that are interested in docker-hadoop-build are comparing it to the libraries listed below
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 6 years ago
- Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.☆37Updated 3 years ago
- A native go client for HDFS☆29Updated 2 years ago
- Integration of Iceberg table management into Spark SQL☆11Updated 5 years ago
- install Cloudera's distribution of Hadoop including Cloudera Manager and Cloudera Search (Beta)☆31Updated 11 years ago
- A template-based cluster provisioning system☆61Updated 2 years ago
- Hbase cluster based on zookeeper/hadoop on kubernetes.☆40Updated 7 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- Ambari Metrics System Plugin for Grafana > v4.5.x☆24Updated 6 years ago
- Scripts to build a Docker image with Apache Impala with Kudu support (no HDFS needed)☆17Updated 4 years ago
- diqube is a fast, distributed, in-memory column-store which enables you to analyze large amounts of read-only data easily☆18Updated 2 years ago
- Flink Examples☆39Updated 8 years ago
- Read druid segments from hadoop☆10Updated 8 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Updated 5 years ago
- Presto K8S Operator☆9Updated 4 years ago
- Example using Grafana with Druid☆11Updated 10 years ago
- Ansible playbooks to help to deploy Apache Hadoop,Spark,Storm,Zookeeper,Elasticsearch,Azkaban,Flume,Hbase,Kafka,Kibana,Logstash☆10Updated 8 years ago
- Thoughts on things I find interesting.☆17Updated 3 months ago
- Docker image with Apache Beam + Flink☆32Updated 8 years ago
- Data sets and Vagrant script to provision a virtual machine for Apache Calcite development☆29Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Cluster Partition Rebalancer For Kafka is a tool that runs in the background on Kafka brokers and lets them move partitions across broker…☆18Updated 7 years ago
- Docker Image for Kudu☆38Updated 6 years ago
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- Llama - Low Latency Application MAster☆34Updated 2 years ago
- An HDFS backed ContentsManager implementation for Jupyter☆12Updated 11 months ago
- Kylin running in a Docker cluster☆46Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Module for accessing OpenTSDB data in HBASE and creating a SparkRDD☆12Updated 10 years ago