ynqa / pandavro
Apache Avro <-> pandas DataFrame
☆136Updated 5 months ago
Alternatives and similar repositories for pandavro:
Users that are interested in pandavro are comparing it to the libraries listed below
- pytest plugin to run the tests with support of pyspark☆84Updated 10 months ago
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- Collection of transforms for the Apache beam python SDK.☆89Updated last year
- ☆196Updated last year
- A tool and library for easily deploying applications on Apache YARN☆142Updated 10 months ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆71Updated 3 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 9 months ago
- Deploy dask on YARN clusters☆69Updated 5 months ago
- BigQuery backend for Ibis☆19Updated last year
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆137Updated last week
- Read Delta tables without any Spark☆47Updated 10 months ago
- Astronomer Core Docker Images☆106Updated 7 months ago
- Docker images for dask☆233Updated last week
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 4 years ago
- ☆127Updated 4 years ago
- Great Expectations Airflow operator☆160Updated 2 months ago
- Helpers & syntactic sugar for PySpark.☆61Updated last year
- Amazon Redshift SQLAlchemy Dialect☆220Updated 6 months ago
- A command-line tool for managing permissions and dependencies for BigQuery authorized views☆89Updated 2 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆49Updated 11 months ago
- Pythonic file-system interface for Google Cloud Storage☆352Updated last week