elek / bigdata-dockerLinks
Docker images for Open Source bigdata/hadoop projects
☆34Updated 6 years ago
Alternatives and similar repositories for bigdata-docker
Users that are interested in bigdata-docker are comparing it to the libraries listed below
Sorting:
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Automated (Ansible) installation of HDP via Ambari Blueprint☆16Updated 8 years ago
- SQL on HBase with Apache Phoenix in Docker☆29Updated 9 years ago
- Sample Spark Streaming application for secure consumption from Kafka☆33Updated 7 years ago
- Multiple node cluster on Docker for self development.☆93Updated 6 years ago
- ☆20Updated 3 years ago
- Hbase cluster based on zookeeper/hadoop on kubernetes.☆40Updated 7 years ago
- Ambari Metrics System Plugin for Grafana > v4.5.x☆24Updated 6 years ago
- Visualize your HDFS cluster usage☆229Updated 4 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- Base hadoop/spark/bigdata image with advanced config loading scripts.☆11Updated 4 years ago
- ☆105Updated 5 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆83Updated 5 years ago
- Documentation placeholder and utilities for all the other containers.☆30Updated 5 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆50Updated 9 years ago
- ☆41Updated 9 years ago
- ☆70Updated 2 years ago
- Running YARN on Kubernetes with PetSet controller.☆166Updated 7 years ago
- presto's elasticsearch connector☆11Updated 8 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 4 years ago
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- Apache Spark ETL Utilities☆40Updated 7 months ago
- install Cloudera's distribution of Hadoop including Cloudera Manager and Cloudera Search (Beta)☆31Updated 11 years ago
- Ambari service to deploy/manage Hortonworks IoT demo☆22Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year