teamclairvoyant / hadoop-deployment-bashLinks
Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.
☆34Updated last year
Alternatives and similar repositories for hadoop-deployment-bash
Users that are interested in hadoop-deployment-bash are comparing it to the libraries listed below
Sorting:
- Cloudera Director sample code☆61Updated 5 years ago
- Cloudera deployment automation with Ansible☆198Updated 4 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- spark on kubernetes☆104Updated 2 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- Ansible playbooks for deploying Hortonworks Data Platform☆128Updated 4 years ago
- A general purpose framework for automating Cloudera Products☆67Updated 5 months ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- Edge2AI Workshop☆70Updated last month
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 5 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated 2 months ago
- Ansible roles to install an Spark Standalone cluster (HDFS/Spark/Jupyter Notebook) or Ambari based Spark cluster☆61Updated last year
- Delta Lake Examples☆12Updated 5 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- HDF masterclass materials☆28Updated 9 years ago
- Ansible playbooks for deploying Hortonworks Data Platform and DataFlow using Ambari Blueprints☆249Updated 4 years ago
- Useful shell scripts for Hadoop/Linux system administrator☆56Updated 6 years ago
- Quickly deploy Hadoop with the help of Ansible and Apache Ambari☆38Updated 10 years ago
- Automatically deploy and configure Template on Nifi☆56Updated last year
- Multi-stage, config driven, SQL based ETL framework using PySpark☆26Updated 5 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- An example Apache Beam project.☆111Updated 8 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 3 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆30Updated 10 years ago
- ☆27Updated 4 years ago
- Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog☆35Updated last year
- The Internals of Spark on Kubernetes☆71Updated 3 years ago