PythonicNinja / pydrillLinks
Python Driver for Apache Drill.
☆61Updated 2 years ago
Alternatives and similar repositories for pydrill
Users that are interested in pydrill are comparing it to the libraries listed below
Sorting:
- Apache Drill Dialect for SQL Alchemy☆54Updated 7 months ago
- python implementation of the parquet columnar file format.☆358Updated 4 years ago
- Official repository for pygrametl - ETL programming in Python☆299Updated 4 months ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated last year
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- JayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.☆383Updated last year
- Convert JSON files to Parquet using PyArrow☆98Updated 2 years ago
- OlaPy, an experimental OLAP engine based on Pandas☆109Updated 2 years ago
- Serializes data into a JSON format using AVRO schema.☆138Updated 4 years ago
- REST-like API exposing Airflow data and operations☆61Updated 7 years ago
- Python DB-API client for Presto☆240Updated 2 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Example for an airflow plugin☆49Updated 9 years ago
- A Python library for working with Table Schema.☆264Updated last year
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆652Updated this week
- Helpers & syntactic sugar for PySpark.☆62Updated 2 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 4 years ago
- A Postgres-backed ContentsManager implementation for Jupyter☆150Updated 2 years ago
- A SQLAlchemy dialect for MonetDB☆40Updated last week
- An extendable Docker image for Airbnb's Superset platform, previously known as Caravel.☆114Updated 3 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated 2 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 7 years ago
- mito ETL tool☆163Updated 4 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 3 years ago
- A Python connector for Druid☆519Updated 4 months ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆259Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆204Updated last month
- Airflow plugin to transfer arbitrary files between operators☆78Updated 7 years ago
- PyAthenaJDBC is an Amazon Athena JDBC driver wrapper for the Python DB API 2.0 (PEP 249).☆94Updated 2 years ago