noirello / pyorc
Python module for Apache ORC file format
☆64Updated last month
Alternatives and similar repositories for pyorc:
Users that are interested in pyorc are comparing it to the libraries listed below
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆232Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated last year
- Multiple node presto cluster on docker container☆124Updated 2 years ago
- Python DB-API client for Presto☆239Updated last year
- A tool and library for easily deploying applications on Apache YARN☆142Updated 10 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆109Updated last year
- ☆209Updated 8 years ago
- ☆20Updated last year
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- A testing framework for Trino☆26Updated 2 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- Spark SQL index for Parquet tables☆134Updated 3 years ago
- Storage connector for Trino☆103Updated this week
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Updated 4 years ago
- Docker image for Apache Hive Metastore☆71Updated last year
- Convert JSON files to Parquet using PyArrow☆95Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- ☆79Updated last year
- Python package to extend Airflow functionality with CWL1.1 support☆186Updated last year
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Python bindings for sqlparser-rs☆174Updated 3 months ago
- REST API for Apache Spark on K8S or YARN☆94Updated this week
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year