frischHWC / datagenLinks
Datagenerator for Data Services
☆16Updated 4 months ago
Alternatives and similar repositories for datagen
Users that are interested in datagen are comparing it to the libraries listed below
Sorting:
- One Click Script to Deploy CDP (CDP PvC & HDP & CDH)☆32Updated 4 months ago
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆58Updated 2 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆282Updated 2 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Updated 2 months ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆667Updated last week
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆297Updated 3 years ago
- A collection of templates for use with Apache NiFi.☆278Updated 9 years ago
- Serde for Cobol Layout to Hive table☆24Updated 6 years ago
- Mirror of Apache Knox☆211Updated last week
- Cloudera deployment automation with Ansible☆200Updated 5 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Updated 3 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆589Updated 2 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Useful shell scripts for Hadoop/Linux system administrator☆57Updated 7 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆133Updated last month
- Spline agent for Apache Spark☆201Updated 2 weeks ago
- ☆103Updated 5 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆193Updated 2 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 3 years ago
- Qubole Sparklens tool for performance tuning Apache Spark☆587Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆265Updated 3 years ago
- JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs☆26Updated 6 months ago
- ☆27Updated 2 years ago
- Ambari stack service for installing and managing Apache Airflow on HDP cluster☆58Updated 7 years ago
- Ambari service for Apache Flink☆126Updated 4 years ago
- Schema Registry☆17Updated last year
- Apache NiFi example flows☆210Updated 6 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Updated 2 years ago
- Example to create lineage in Atlas with sqoop and spark☆14Updated 8 years ago
- Data Lineage Tracking And Visualization Solution☆653Updated this week