ynqa / pandavro
Apache Avro <-> pandas DataFrame
☆135Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for pandavro
- pytest plugin to run the tests with support of pyspark☆85Updated 8 months ago
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- Deploy dask on YARN clusters☆69Updated 3 months ago
- Apache (Py)Spark type annotations (stub files).☆115Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆193Updated 5 months ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated last year
- A tool and library for easily deploying applications on Apache YARN☆142Updated 8 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- ☆196Updated last year
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 3 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year
- Command line (CLI) tool to inspect Apache Parquet files on the go☆175Updated last year
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆262Updated 2 months ago
- Docker images for dask☆232Updated last week
- Read Delta tables without any Spark☆47Updated 8 months ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- triggering a DAG run multiple times☆85Updated 8 months ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆85Updated 8 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆134Updated last month
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆104Updated this week
- Astronomer Core Docker Images☆106Updated 5 months ago
- BigQuery backend for Ibis☆19Updated last year
- A consistent table management library in python☆161Updated last year
- Airflow support for Marquez☆32Updated 3 years ago
- python implementation of the parquet columnar file format.☆341Updated 3 years ago
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 4 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- A web frontend for scheduling Jupyter notebook reports☆251Updated 2 years ago