flokkr / docker-baseimageLinks
Base hadoop/spark/bigdata image with advanced config loading scripts.
☆11Updated 4 years ago
Alternatives and similar repositories for docker-baseimage
Users that are interested in docker-baseimage are comparing it to the libraries listed below
Sorting:
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- A Docker image for Livy, the REST Spark Server☆15Updated 9 years ago
- A small Spark "cluster", running in standalone mode. Suitable for testing and development.☆23Updated last year
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- Hadoop MapReduce tool to convert Avro data files to Parquet format.☆34Updated 12 years ago
- An Ansible role for installing Apache Spark.☆58Updated 7 years ago
- Spark example code demonstrating RDD, DataFrame and DataSet APIs.☆37Updated 9 years ago
- Docker images for Open Source bigdata/hadoop projects☆34Updated 6 years ago
- Code to index Hive tables to Solr and Solr indexes to Hive☆48Updated 6 years ago
- functionstest☆33Updated 8 years ago
- Docker Image for Kudu☆38Updated 6 years ago
- Set up a Hadoop and/or Spark cluster running within Docker containers on a single physical machine☆77Updated 4 years ago
- Docker images used internally by various Teradata projects for automation, testing, etc☆40Updated 7 years ago
- Docker Cloudera Quick Start Image☆93Updated 7 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- Docker image for Apache Spark☆76Updated 5 years ago
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Updated 7 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 8 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- ☆70Updated 2 years ago
- Use Kubernetes to autoscale your spark clusters.☆10Updated 6 years ago
- Python client bindings for the Apache Ambari REST API☆44Updated 3 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- Vagrant project to spin up a cluster virtual machines with Hadoop v2.4.1 and Spark v1.0.1☆83Updated 9 years ago
- HDFS / Spark / Mesos / Elasticsearch / Kibana / Zeppelin BigDataLab with Ansible☆31Updated 8 years ago
- A tutorial on Apache Spark Unit Testing☆37Updated 9 years ago