miho120 / ambari-airflow-mpack
Ambari stack service for installing and managing Apache Airflow on HDP cluster
☆59Updated 6 years ago
Alternatives and similar repositories for ambari-airflow-mpack:
Users that are interested in ambari-airflow-mpack are comparing it to the libraries listed below
- CSD for Apache Airflow☆20Updated 5 years ago
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 6 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆180Updated 2 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆101Updated 2 years ago
- Workshops on how to setup security on Hadoop using HDP sandboxes☆100Updated 7 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆83Updated 5 years ago
- Cloudera deployment automation with Ansible☆197Updated 4 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- A general purpose framework for automating Cloudera Products☆66Updated last month
- Spark Clickhouse Connector☆72Updated 4 years ago
- ☆102Updated 5 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆176Updated 3 years ago
- Presto-Teradata connector☆16Updated 2 years ago
- ☆27Updated last year
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- ☆40Updated last year
- Useful shell scripts for Hadoop/Linux system administrator☆57Updated 6 years ago
- Hadoop FSImage Analyzer (HFSA)☆59Updated this week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆120Updated 3 weeks ago
- Rocksdb state storage implementation for Structured Streaming.☆17Updated 4 years ago
- A library for querying Druid data sources with Apache Spark☆23Updated 4 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated last year
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago