dropbox / PyHive
Python interface to Hive and Presto. π
β1,679Updated 5 months ago
Alternatives and similar repositories for PyHive:
Users that are interested in PyHive are comparing it to the libraries listed below
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)β732Updated 2 months ago
- Python DB-API client for Prestoβ239Updated last year
- A developer-friendly Python library to interact with Apache HBaseβ609Updated 6 months ago
- API and command line interface for HDFSβ273Updated 4 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,007Updated 2 years ago
- A Python connector for Druidβ510Updated 5 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,350Updated last year
- β209Updated 8 years ago
- Read - Write JSON SerDe for Apache Hive.β736Updated last year
- Apache Parquet Javaβ2,696Updated this week
- A pure python HDFS clientβ855Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clustersβ1,339Updated last month
- Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.β1,118Updated 4 years ago
- Real-time Query for Hadoop; mirror of Apache Impalaβ34Updated 2 years ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orgaβ¦β2,229Updated last week
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfacesβ325Updated 4 years ago
- β517Updated 2 years ago
- β1,619Updated 2 weeks ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β899Updated 2 months ago
- PySpark + Scikit-learn = Sparkit-learnβ1,152Updated 4 years ago
- REST job server for Apache Sparkβ2,835Updated 3 weeks ago
- β1,016Updated last week
- Data Lineage Tracking And Visualization Solutionβ609Updated this week
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.β553Updated 3 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availabilityβ232Updated 2 years ago
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyondβ927Updated this week
- Mirror of Apache Oozieβ722Updated this week
- JayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.β369Updated 6 months ago
- python implementation of the parquet columnar file format.β805Updated 2 months ago
- A Python MapReduce and HDFS API for Hadoopβ237Updated last year