gateway-experiments / hadoop-yarn-api-python-clientLinks
Python client for Hadoop® YARN API
☆109Updated 2 years ago
Alternatives and similar repositories for hadoop-yarn-api-python-client
Users that are interested in hadoop-yarn-api-python-client are comparing it to the libraries listed below
Sorting:
- ☆208Updated 9 years ago
- Cloudera Manager API Client☆308Updated last year
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆183Updated 3 years ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆266Updated 2 years ago
- ☆103Updated 5 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆283Updated last month
- TPC-DS Kit for Impala☆171Updated last year
- Plugin for Presto to allow addition of user functions easily☆120Updated 4 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Updated 9 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆178Updated 3 years ago
- Scripts for generating Grafana dashboards for monitoring Spark jobs☆240Updated 10 years ago
- Mirror of Apache Bahir☆335Updated 2 years ago
- Cloudera Manager Extensibility Tools and Documentation.☆190Updated last year
- Remedy small files by combining them into larger ones.☆194Updated 3 years ago
- Python DB-API client for Presto☆238Updated last year
- ☆57Updated 6 years ago
- Facebook's Hive UDFs☆275Updated 2 weeks ago
- Build configuration-driven ETL pipelines on Apache Spark☆161Updated 2 years ago
- API and command line interface for HDFS☆274Updated 11 months ago
- Lightweight Azkaban client☆77Updated 5 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆740Updated last month
- Ambari stack service for easily installing and managing Hue on HDP cluster☆107Updated 6 years ago
- A collection of Hive UDFs☆75Updated 5 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆153Updated 2 years ago
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆117Updated last year
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆235Updated 3 years ago
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- Mirror of Apache Atlas (Incubating)☆95Updated 2 years ago