frischHWC / datagenLinks
Datagenerator for Data Services
☆16Updated 9 months ago
Alternatives and similar repositories for datagen
Users that are interested in datagen are comparing it to the libraries listed below
Sorting:
- Copy Hive tables definitions to Compute Cluster, while still using Storage on original cluster☆12Updated this week
- One Click Script to Deploy CDP (CDP PvC & HDP & CDH)☆32Updated last month
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆58Updated last year
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆655Updated 2 weeks ago
- Cloudera Manager Extensibility Tools and Documentation.☆190Updated last year
- Kerberos and Hadoop: The Madness beyond the Gate☆279Updated 2 years ago
- A collection of templates for use with Apache NiFi.☆280Updated 8 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Updated 8 years ago
- Useful shell scripts for Hadoop/Linux system administrator☆57Updated 7 years ago
- Cloudera deployment automation with Ansible☆198Updated 4 years ago
- TPC-DS Kit for Impala☆171Updated last year
- Edge2AI Workshop☆70Updated 3 months ago
- Mirror of Apache Knox☆205Updated last week
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Cloudera Manager API Client☆308Updated last year
- Schema Registry☆17Updated last year
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- Port of TPC-DS dsdgen to Java☆52Updated last year
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 2 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆284Updated last month
- Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version☆271Updated 11 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆127Updated 3 weeks ago
- Cloudera CDP SDK for Java☆14Updated this week
- ☆103Updated 5 years ago
- Data Lineage Tracking And Visualization Solution☆641Updated last week
- Hadoop FSImage Analyzer (HFSA)☆62Updated this week
- Ambari service for Apache Flink☆127Updated 4 years ago
- Remedy small files by combining them into larger ones.☆22Updated 6 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago