paulvid / cdp-one-clickLinks
☆16Updated 4 years ago
Alternatives and similar repositories for cdp-one-click
Users that are interested in cdp-one-click are comparing it to the libraries listed below
Sorting:
- Hadoop utility jar for troubleshooting integration with cloud object stores☆37Updated this week
- ☆12Updated 2 years ago
- ☆24Updated 2 years ago
- Cloudera Director sample code☆61Updated 6 years ago
- Utilities to help HBase as a service in HDInsight Azure☆14Updated 2 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆91Updated last year
- A general purpose framework for automating Cloudera Products☆69Updated 10 months ago
- ☆27Updated 2 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆37Updated last year
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆62Updated 2 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 4 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Databricks Migration Tools☆43Updated 4 years ago
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated 3 months ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated 2 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 7 months ago
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Updated 6 years ago
- Presentations and other resources.☆36Updated 5 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 3 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- Materials for various Hadoop & Nifi related workshops☆51Updated 6 years ago
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- Edge2AI Workshop☆70Updated 7 months ago
- Tools to deploy Hadoop on EMC Isilon☆17Updated 9 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆80Updated last week
- Big Data Demystified meetup and blog examples☆31Updated last year
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Updated 5 years ago
- ☆100Updated 2 years ago
- Application to securely map users on a multi tenant Amazon EMR cluster to different IAM Roles and then assume the mapped Role.☆24Updated 2 years ago