noirello / pyorc
Python module for Apache ORC file format
☆64Updated last week
Related projects: ⓘ
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆232Updated 2 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 4 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- Python DB-API client for Presto☆239Updated 9 months ago
- ☆258Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- ☆20Updated 11 months ago
- A client for connecting and running DDLs on hive metastore.☆51Updated 6 months ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 6 months ago
- Apache Drill Dialect for SQL Alchemy☆53Updated 2 months ago
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Updated 3 years ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆42Updated last week
- Python client for Hadoop® YARN API☆109Updated last year
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- Multiple node presto cluster on docker container☆120Updated 2 years ago
- Python bindings for sqlparser-rs☆152Updated last week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆296Updated 8 months ago
- Docker image for Apache Hive Metastore☆72Updated last year
- Convert JSON files to Parquet using PyArrow☆94Updated 8 months ago
- API and command line interface for HDFS☆268Updated 3 months ago
- ☆77Updated last year
- Spark ClickHouse Connector build on DataSourceV2 API☆181Updated this week
- ☆144Updated this week
- Python client for Trino☆323Updated 2 weeks ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated 10 months ago
- Storage connector for Trino☆90Updated 3 weeks ago
- A library for Spark DataFrame using MinIO Select API☆96Updated 4 years ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆108Updated 7 months ago