miho120 / ambari-airflow-mpack
Ambari stack service for installing and managing Apache Airflow on HDP cluster
☆59Updated 6 years ago
Alternatives and similar repositories for ambari-airflow-mpack:
Users that are interested in ambari-airflow-mpack are comparing it to the libraries listed below
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆180Updated 2 years ago
- CSD for Apache Airflow☆20Updated 5 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Updated 3 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Collection of tools for bootstrapping Apache Ambari & deploying clusters☆83Updated 5 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Spark connector for SFTP☆100Updated 2 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆100Updated 2 years ago
- ☆102Updated 5 years ago
- Demos around Ambari Views, Services, Blueprints☆63Updated 9 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆120Updated this week
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆185Updated 2 years ago
- Cloudera deployment automation with Ansible☆198Updated 4 years ago
- An opinionated auto-deployer for the Hortonworks Platform☆34Updated 4 years ago
- Plugin for Presto to allow addition of user functions easily☆117Updated 4 years ago
- Star Schema Benchmark using the Hive / Druid Integration☆30Updated 7 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- Edge2AI Workshop☆69Updated 2 months ago
- Hadoop FSImage Analyzer (HFSA)☆59Updated last week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- ☆40Updated last year
- ☆27Updated 2 months ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆82Updated 5 years ago
- Spark on Kubernetes infrastructure Docker images repo☆37Updated 2 years ago
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 5 years ago
- Ambari service for Presto☆44Updated 2 months ago
- The Internals of Spark on Kubernetes☆71Updated 2 years ago
- ☆28Updated last year
- A Spark Atlas connector to track data lineage in Apache Atlas☆267Updated 2 years ago