paulvid / cdp-one-clickLinks
☆16Updated 4 years ago
Alternatives and similar repositories for cdp-one-click
Users that are interested in cdp-one-click are comparing it to the libraries listed below
Sorting:
- ☆12Updated 2 years ago
- Hadoop utility jar for troubleshooting integration with cloud object stores☆36Updated 2 weeks ago
- Cloudera Director sample code☆61Updated 5 years ago
- A general purpose framework for automating Cloudera Products☆67Updated 7 months ago
- ☆27Updated last year
- Edge2AI Workshop☆70Updated 4 months ago
- HDF masterclass materials☆28Updated 9 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- ☆24Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- Events about the open source data stack☆13Updated 3 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Spark Scala docker container sample for AWS testing - EKS & S3☆24Updated 7 years ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆22Updated last year
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Utilities to help HBase as a service in HDInsight Azure☆14Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- TPCDS benchmark for various engines☆18Updated 3 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 3 months ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆76Updated last week
- CDP examples and tutorials☆19Updated 4 months ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- An Ansible collection of utilities and other resources for Cloudera Platform deployments☆12Updated this week