fabiog1901 / SingleNodeCDPCluster
☆27Updated last year
Alternatives and similar repositories for SingleNodeCDPCluster:
Users that are interested in SingleNodeCDPCluster are comparing it to the libraries listed below
- Edge2AI Workshop☆69Updated this week
- ☆29Updated this week
- A general purpose framework for automating Cloudera Products☆66Updated 2 months ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Materials for various Hadoop & Nifi related workshops☆19Updated 3 years ago
- HDF masterclass materials☆28Updated 9 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- Leveraging Hortonworks' HDP 3.1.0 and HDF 3.4.0 components, this tutorial guides the user through steps to stream data from a REST API in…☆20Updated 5 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆59Updated 6 years ago
- cloudera.cloud - an Ansible collection for Cloudera Data Platform (CDP) for Public and Private Cloud☆20Updated 2 months ago
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆11Updated this week
- Useful shell scripts for Hadoop/Linux system administrator☆57Updated 6 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- CDP examples and tutorials☆19Updated last year
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- MapReduce performance testing using teragen and terasort☆18Updated 3 years ago
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆58Updated last year
- ☆25Updated 8 years ago
- Sample processing code using Spark 2.1+ and Scala☆52Updated 4 years ago
- Datagenerator for Data Services☆16Updated 4 months ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Cloudera Director sample code☆61Updated 5 years ago
- An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.☆34Updated last week
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Memory / Configuration Calculator for Hive LLAP☆14Updated 4 years ago
- cloudera.exe -- an Ansible collection enabling runlevel management of CDP Public Cloud deployments as well as numerous utilities for depl…☆12Updated 7 months ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- ☆16Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year