internetarchive / snakebite-py3
Pure python HDFS client: python3.x version
☆22Updated last year
Alternatives and similar repositories for snakebite-py3:
Users that are interested in snakebite-py3 are comparing it to the libraries listed below
- SQLAlchemy dialect for Turbodbc☆23Updated 7 months ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- lazyimport lets you import python modules lazily.☆11Updated 6 years ago
- Concurrent appendable key-value storage☆105Updated 6 months ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- Python Driver for Apache Drill.☆58Updated last year
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆49Updated 11 months ago
- Optional extensions for petl based on third party libraries.☆44Updated 9 years ago
- ♃ Debian packaging of JupyterHub, a multi-user server for Jupyter notebooks☆29Updated 2 years ago
- Amazon S3 filesystem for PyFilesystem2☆154Updated 6 months ago
- API and command line interface for HDFS☆273Updated 3 months ago
- Example for an airflow plugin☆49Updated 8 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Deploy dask on YARN clusters☆69Updated 5 months ago
- Prometheus Exporter for Airflow☆160Updated 7 months ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 10 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Apache Avro <-> pandas DataFrame☆136Updated 5 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Jupyter kernel for scala and spark☆187Updated last year
- SQL on dataframes - pandas and dask☆64Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆268Updated 4 months ago
- Spavro is a (sp)eedier avro library -- Spavro is a fork of the official Apache AVRO python 2 implementation with the goal of greatly impr…☆26Updated last year
- Helper to allow Python Celery tasks to do work in a Spark job.☆27Updated 2 years ago
- Python implementation of postgres meta commands (backslash commands)☆76Updated last month