gtoonstra / databook
A facebook for data
☆26Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for databook
- ETLy is an add-on dashboard service on top of Apache Airflow.☆69Updated last year
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 11 months ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆68Updated 5 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago
- Metadata service library for Amundsen☆83Updated last year
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆21Updated 3 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Updated 3 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated 11 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Data ingestion library for Amundsen to build graph and search index☆206Updated 8 months ago
- DBT Cloud Plugin for Airflow☆38Updated 6 months ago
- Data models for snowplow analytics.☆126Updated 10 months ago
- SQL data model for working with Snowplow web data. Supports Redshift and Looker. Snowflake and BigQuery coming soon☆61Updated 3 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 6 years ago
- Utility functions for dbt projects running on Spark☆31Updated last year
- 🚚 ETL for Spark and Airflow☆24Updated 6 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆20Updated 2 weeks ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆60Updated last year
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago