cloudera / impylaLinks
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
☆737Updated 3 months ago
Alternatives and similar repositories for impyla
Users that are interested in impyla are comparing it to the libraries listed below
Sorting:
- Python interface to Hive and Presto. 🐝☆1,684Updated 10 months ago
- ☆208Updated 9 years ago
- Python DB-API client for Presto☆238Updated last year
- A developer-friendly Python library to interact with Apache HBase☆609Updated 10 months ago
- API and command line interface for HDFS☆273Updated 8 months ago
- A Python connector for Druid☆517Updated this week
- A pure python HDFS client☆857Updated 3 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆235Updated 2 years ago
- ☆524Updated 3 years ago
- Mirror of Apache Hivemall (incubating)☆312Updated 2 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- A Python MapReduce and HDFS API for Hadoop☆239Updated 4 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,354Updated 3 weeks ago
- Mirror of Apache Toree (Incubating)☆745Updated last month
- A collection of examples using flinks new python API☆246Updated 2 months ago
- Read - Write JSON SerDe for Apache Hive.☆737Updated last year
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆913Updated last week
- JayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.☆375Updated 11 months ago
- Spark package for checking data quality☆221Updated 5 years ago
- The Internals of Spark Structured Streaming☆419Updated 2 years ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,360Updated last year
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆280Updated 6 years ago
- Data Lineage Tracking And Visualization Solution☆630Updated last week
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated 9 months ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 4 years ago
- Facebook's Hive UDFs☆271Updated 3 months ago
- Qubole Sparklens tool for performance tuning Apache Spark☆579Updated 11 months ago
- A Spark Atlas connector to track data lineage in Apache Atlas☆267Updated 2 years ago
- Some useful custom hive udf functions, especial array, json, math, string functions.☆225Updated 10 months ago