internetarchive / snakebite-py3Links
Pure python HDFS client: python3.x version
☆23Updated 2 months ago
Alternatives and similar repositories for snakebite-py3
Users that are interested in snakebite-py3 are comparing it to the libraries listed below
Sorting:
- Apache (Py)Spark type annotations (stub files).☆117Updated 3 years ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 3 years ago
- Python Driver for Apache Drill.☆60Updated 2 years ago
- API and command line interface for HDFS☆274Updated 11 months ago
- Helpers & syntactic sugar for PySpark.☆62Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆144Updated last year
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago
- python implementation of the parquet columnar file format.☆354Updated 3 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆55Updated last month
- Python DB-API client for Presto☆238Updated last year
- Deploy dask on YARN clusters☆69Updated last year
- Prometheus Exporter for Airflow☆161Updated last year
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Ansible role to install Apache Airflow☆84Updated 5 months ago
- A DBAPI and SQLAlchemy dialect for Elasticsearch☆117Updated last year
- A client for the Confluent Schema Registry API implemented in Python☆53Updated 2 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- A Python binding for ./jq☆198Updated last year
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 5 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated 2 years ago
- Amazon S3 filesystem for PyFilesystem2☆156Updated last year
- Python multiprocessing objects and patterns☆13Updated 5 years ago
- Serializes data into a JSON format using AVRO schema.☆137Updated 3 years ago
- Apache Avro <-> pandas DataFrame☆138Updated last year
- systemd wrapper on Cython☆103Updated last month
- Gather system information about airflow processes☆18Updated 5 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆326Updated 4 years ago
- Command line (CLI) tool to inspect Apache Parquet files on the go☆194Updated last year