cloudera / impyla
Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
โ731Updated last week
Alternatives and similar repositories for impyla:
Users that are interested in impyla are comparing it to the libraries listed below
- Python interface to Hive and Presto. ๐โ1,678Updated 6 months ago
- โ209Updated 8 years ago
- A developer-friendly Python library to interact with Apache HBaseโ607Updated 7 months ago
- Python DB-API client for Prestoโ238Updated last year
- Livy is an open source REST interface for interacting with Apache Spark from anywhereโ1,006Updated 2 years ago
- API and command line interface for HDFSโ272Updated 5 months ago
- A pure python HDFS clientโ856Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clustersโ1,342Updated this week
- Python client for Hadoopยฎ YARN APIโ109Updated 2 years ago
- Mirror of Apache Hivemall (incubating)โ311Updated 2 years ago
- A Python connector for Druidโ514Updated 6 months ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availabilityโ233Updated 2 years ago
- โ519Updated 3 years ago
- Data Lineage Tracking And Visualization Solutionโ613Updated this week
- A connector for Spark that allows reading and writing to/from Redis clusterโ945Updated 4 months ago
- Mirror of Apache Toree (Incubating)โ741Updated last week
- Read - Write JSON SerDe for Apache Hive.โ735Updated last year
- A collection of examples using flinks new python APIโ244Updated 6 years ago
- A Python MapReduce and HDFS API for Hadoopโ238Updated 3 weeks ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.โ268Updated 6 months ago
- Qubole Sparklens tool for performance tuning Apache Sparkโ571Updated 8 months ago
- Apache Kylin Python Client Libraryโ63Updated last year
- Spark package for checking data qualityโ221Updated 5 years ago
- Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.โ167Updated last year
- The Internals of Spark Structured Streamingโ418Updated 2 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bitโฆโ283Updated 6 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfacesโ325Updated 4 years ago
- A Spark Atlas connector to track data lineage in Apache Atlasโ267Updated 2 years ago
- Examples for High Performance Sparkโ506Updated 4 months ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.โ282Updated 6 years ago