kgyrtkirk / hive-dev-boxLinks
☆23Updated last year
Alternatives and similar repositories for hive-dev-box
Users that are interested in hive-dev-box are comparing it to the libraries listed below
Sorting:
- Kerberos and Hadoop: The Madness beyond the Gate☆280Updated 2 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Updated last month
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆941Updated this week
- Qubole Sparklens tool for performance tuning Apache Spark☆586Updated last year
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆666Updated 2 months ago
- hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE☆296Updated 3 years ago
- ☆393Updated last year
- A load balancer / proxy / gateway for prestodb☆358Updated last year
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆808Updated this week
- Cloudera Manager Extensibility Tools and Documentation.☆193Updated 2 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆182Updated 3 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆265Updated 3 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,369Updated 2 years ago
- Testing Sandbox for Hadoop Ecosystem Components☆43Updated last month
- A slightly moist lipstick-on-pig clone for Apache Hive☆23Updated 2 years ago
- Prerequisites checker for Cloudera Manager and CDP PVC Base installations☆58Updated 2 years ago
- TPC-DS Kit for Impala☆171Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Updated 2 months ago
- Data Lineage Tracking And Visualization Solution☆653Updated this week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆131Updated last week
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆184Updated 3 years ago
- Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.☆128Updated 7 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆877Updated last week
- Tutorial on how to setup Trino and Apache Ranger using docker☆41Updated last year
- The Internals of Spark SQL☆483Updated last month
- Spline agent for Apache Spark☆201Updated last month
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆336Updated 2 years ago
- Use the TPC-DS benchmark to test Spark SQL performance☆183Updated 5 years ago
- Apache Iceberg Documentation Site☆42Updated last year
- Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data☆93Updated 7 years ago