dropbox / PyHive
Python interface to Hive and Presto. π
β1,678Updated 6 months ago
Alternatives and similar repositories for PyHive:
Users that are interested in PyHive are comparing it to the libraries listed below
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)β731Updated last week
- Python DB-API client for Prestoβ238Updated last year
- Livy is an open source REST interface for interacting with Apache Spark from anywhereβ1,006Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clustersβ1,342Updated this week
- A developer-friendly Python library to interact with Apache HBaseβ607Updated 7 months ago
- β209Updated 8 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfacesβ325Updated 4 years ago
- Mirror of Apache griffinβ1,150Updated last month
- A pure python HDFS clientβ856Updated 2 years ago
- A Python connector for Druidβ514Updated 6 months ago
- A connector for Spark that allows reading and writing to/from Redis clusterβ945Updated 4 months ago
- Real-time Query for Hadoop; mirror of Apache Impalaβ34Updated 2 years ago
- ETL best practices with airflow, with examplesβ1,318Updated 5 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.β904Updated 3 months ago
- Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.β1,117Updated 4 years ago
- Data Lineage Tracking And Visualization Solutionβ613Updated this week
- API and command line interface for HDFSβ272Updated 5 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkβ1,355Updated last year
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orgaβ¦β2,236Updated this week
- PySpark + Scikit-learn = Sparkit-learnβ1,154Updated 4 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availabilityβ233Updated 2 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.β553Updated 3 years ago
- β1,625Updated this week
- Read - Write JSON SerDe for Apache Hive.β735Updated last year
- Mirror of Apache Oozieβ723Updated last month
- Apache Tezβ490Updated this week
- Apache Parquet Formatβ1,892Updated this week
- Apache Parquet Javaβ2,730Updated this week
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphsβ1,024Updated this week
- Dynamically generate Apache Airflow DAGs from YAML configuration filesβ1,250Updated 3 weeks ago