bigdatafoundation / docker-hadoop
Dockerfile for running Hadoop on Ubuntu
☆91Updated last year
Alternatives and similar repositories for docker-hadoop:
Users that are interested in docker-hadoop are comparing it to the libraries listed below
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆242Updated 10 years ago
- Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch a…☆418Updated 8 years ago
- ☆70Updated 2 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆155Updated 10 months ago
- Mirror of Apache Myriad (Incubating)☆154Updated last year
- Example programs and scripts for accessing parquet files☆30Updated 7 years ago
- Hadoop on Mesos☆175Updated 2 years ago
- Docker Cloudera Quick Start Image☆92Updated 7 years ago
- Ambari service for Apache Zeppelin notebook☆71Updated 7 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆126Updated 9 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Kite SDK Examples☆99Updated 3 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- Examples of Spark 2.0☆211Updated 3 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Docker image with Ambari☆290Updated 7 years ago
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆94Updated 4 years ago
- A virtual Hadoop cluster running CDH5☆103Updated 9 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- Ansible playbooks to construct distributed computing environments☆62Updated 3 years ago
- Remedy small files by combining them into larger ones.☆193Updated 2 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 8 years ago
- Ready-to-use, manually tuned Cloudera Hadoop Distribution 5 provisioned cluster☆68Updated 9 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- Visualize your HDFS cluster usage☆229Updated 4 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- Storm on Mesos!☆138Updated 3 years ago
- Simple Spark Application☆76Updated last year
- Examples for learning spark☆332Updated 9 years ago