devinstevenson / pure-transport
A thrift transport for PyHive using Pure SASL
☆17Updated 5 years ago
Alternatives and similar repositories for pure-transport:
Users that are interested in pure-transport are comparing it to the libraries listed below
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- Python PMML scoring library for PySpark as SparkML Transformer☆22Updated 4 months ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆268Updated 7 months ago
- Python module for Apache ORC file format☆64Updated 2 months ago
- Deploy dask on YARN clusters☆69Updated 8 months ago
- API and command line interface for HDFS☆272Updated 7 months ago
- A collection of examples using flinks new python API☆244Updated last week
- Function dependencies resolution and execution☆70Updated 4 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 4 years ago
- Python Driver for Apache Drill.☆59Updated 2 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- OlaPy, an experimental OLAP engine based on Pandas☆107Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆120Updated last year
- PMML scoring library for Spark as SparkML Transformer☆21Updated 6 months ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- A xlsx and html rendering library for rendering data available in Pandas DataFrames.☆25Updated 11 months ago
- SQLFlow client library for Python☆29Updated 2 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- DDL parase and Convert to BigQuery JSON schema and DDL statements☆88Updated last year
- Phoenix database adapter for Python (migrated to the Apache Phoenix repo)☆26Updated 3 years ago
- Optional extensions for petl based on third party libraries.☆44Updated 9 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated 2 years ago
- A high-performance Python Kafka client. Efficiently from Kafka to Pandas and back.☆40Updated 6 years ago
- Vinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.☆65Updated 3 years ago
- Python DB-API client for Presto☆238Updated last year
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆233Updated 2 years ago