Factual / parquet-rewriter
A library to mutate parquet files
☆19Updated last year
Alternatives and similar repositories for parquet-rewriter:
Users that are interested in parquet-rewriter are comparing it to the libraries listed below
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Example for an airflow plugin☆49Updated 8 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 7 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 3 months ago
- DBT Cloud Plugin for Airflow☆38Updated 8 months ago
- Ibis analytics, with Ibis (and more!)☆20Updated 4 months ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated last year
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- A facebook for data☆26Updated 5 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 9 months ago
- Python Driver for Apache Drill.☆58Updated last year
- ☆79Updated last year
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- Data Access Layer☆28Updated 2 years ago
- Read Delta tables without any Spark☆47Updated 10 months ago
- ☆18Updated 9 months ago
- Parse dbt artifacts and search dbt models with Algolia☆52Updated 3 years ago
- Zero configuration Airflow plugin that let you manage your DAG files.☆38Updated 3 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆81Updated 4 years ago
- Demos of Materialize, the operational data warehouse.☆51Updated 4 months ago