frischHWC / datagenLinks
Datagenerator for Data Services
☆16Updated 3 weeks ago
Alternatives and similar repositories for datagen
Users that are interested in datagen are comparing it to the libraries listed below
Sorting:
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆12Updated 3 weeks ago
- One Click Script to Deploy CDP (CDP PvC & HDP & CDH)☆32Updated 3 weeks ago
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆58Updated last year
- Data Lineage Tracking And Visualization Solution☆642Updated last week
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated last year
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆658Updated last month
- Cloudera Manager Extensibility Tools and Documentation.☆190Updated last year
- Edge2AI Workshop☆70Updated 4 months ago
- Port of TPC-DS dsdgen to Java☆51Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- A collection of templates for use with Apache NiFi.☆280Updated 8 years ago
- Presto-Teradata connector☆16Updated 3 years ago
- Hadoop FSImage Analyzer (HFSA)☆62Updated this week
- Cloudera deployment automation with Ansible☆199Updated 4 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆129Updated last month
- Mirror of Apache Knox☆205Updated 2 weeks ago
- Storage connector for Trino☆116Updated last week
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆295Updated 2 years ago
- ☆27Updated last year
- An Ansible collection for Cloudera Platform for cloud and Data Services☆20Updated 3 weeks ago
- Example to create lineage in Atlas with sqoop and spark☆14Updated 8 years ago
- Cloudera CDP SDK for Java☆15Updated 3 weeks ago
- Mirror of Apache Ranger☆15Updated last year
- A general purpose framework for automating Cloudera Products☆67Updated 7 months ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆282Updated 3 weeks ago
- Fork of Apache Ambari maintained by Clemlab Company☆51Updated last week
- A data generator source connector for Flink SQL based on data-faker.☆230Updated 2 years ago