Factual / parquet-rewriter
A library to mutate parquet files
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for parquet-rewriter
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆78Updated last week
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- A collection of python utility functions☆12Updated 4 months ago
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- 🍻 Homebrew formulae for installing dbt on macOS☆12Updated 6 months ago
- Airflow workflow management platform chef cookbook.☆68Updated 5 years ago
- IPython magics to work with DBT☆14Updated 2 years ago
- event-triggered plugins for airflow☆21Updated 4 years ago
- Python SDK for working with Snowplow enriched events in Spark, AWS Lambda et al.☆21Updated last year
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.☆28Updated 6 years ago
- A facebook for data☆26Updated 5 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- Make dbt great again! Enables end user to extend dbt to his/her needs☆13Updated this week
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- Search service library for Amundsen☆54Updated 6 months ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- ☆23Updated last year
- Example for an airflow plugin☆49Updated 8 years ago
- Python - Java/Scala API for the Hopsworks feature store☆53Updated this week
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Helpers & syntactic sugar for PySpark.☆60Updated last year