internetarchive / snakebite-py3
Pure python HDFS client: python3.x version
☆22Updated this week
Alternatives and similar repositories for snakebite-py3:
Users that are interested in snakebite-py3 are comparing it to the libraries listed below
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Python Driver for Apache Drill.☆59Updated 2 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆50Updated last year
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Deploy dask on YARN clusters☆69Updated 6 months ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 4 years ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 11 months ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- lazyimport lets you import python modules lazily.☆11Updated 6 years ago
- transformpy is a Python 2/3 module for doing transforms on "streams" of data☆29Updated 7 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- API and command line interface for HDFS☆272Updated 4 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 4 years ago
- Apache Drill Dialect for SQL Alchemy☆54Updated 7 months ago
- Helpers & syntactic sugar for PySpark.☆61Updated last year
- Python bindings for FarmHash and CityHash☆37Updated 10 months ago
- SQLAlchemy dialect for Turbodbc☆23Updated 8 months ago
- Gather system information about airflow processes☆18Updated 4 years ago
- IP Address dtype and block for pandas☆104Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- Cachy provides a simple yet effective caching library.☆42Updated 2 years ago
- Simple dataclasses configuration management for Python with hocon/json/yaml/properties/env-vars/dict/cli support.☆82Updated 3 weeks ago
- triggering a DAG run multiple times☆86Updated 11 months ago
- Apache Avro <-> pandas DataFrame☆136Updated 6 months ago
- Python bindings for sqlparser-rs☆175Updated last week