mapr / mapr-docker-multiLinks
☆15Updated 9 years ago
Alternatives and similar repositories for mapr-docker-multi
Users that are interested in mapr-docker-multi are comparing it to the libraries listed below
Sorting:
- Scripts to validate that a cluster is ready for MapR Data Platform installation☆85Updated 5 years ago
- ☆34Updated 6 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Gallery of Apache Zeppelin notebooks☆216Updated 6 years ago
- Docker build for Zeppelin, a web-based Spark notebook☆221Updated 5 years ago
- ☆24Updated 10 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆613Updated 2 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago
- Ansible playbooks for deploying Hortonworks Data Platform☆128Updated 4 years ago
- Ansible playbooks for Apache Spark on kube☆27Updated 8 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆92Updated last year
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19Updated 8 years ago
- Scripts used to setup a Spark cluster on EC2☆390Updated 7 years ago
- Ansible playbook that installs a Hadoop cluster, with HBase, Hive, Presto for analytics, and Ganglia, Smokeping, Fluentd, Elasticsearch a…☆418Updated 8 years ago
- Ansible playbooks for deploying Hortonworks Data Platform and DataFlow using Ambari Blueprints☆248Updated 4 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 2 years ago
- Docker image for general apache spark client☆117Updated 8 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- PyAthenaJDBC is an Amazon Athena JDBC driver wrapper for the Python DB API 2.0 (PEP 249).☆95Updated last year
- Scripts and instructions to facilitate running Deep Learning Tasks on Amazon EMR☆63Updated last year
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆241Updated 10 years ago
- Materials for various Hadoop & Nifi related workshops☆19Updated 4 years ago
- ☆70Updated 2 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 11 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- An extendable Docker image for Airbnb's Superset platform, previously known as Caravel.☆114Updated 3 years ago