deordie / deordie-digest
Data Engineering Digest
☆27Updated 2 months ago
Related projects: ⓘ
- Command-line interface to quickly generate fake CSV and JSON data☆72Updated 2 months ago
- Airflow declarative DAGs via YAML☆131Updated last year
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆56Updated 8 months ago
- Data validation library for PySpark 3.0.0☆34Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆40Updated 7 months ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated 7 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆45Updated 6 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- A Table format agnostic data sharing framework☆36Updated 7 months ago
- ☆16Updated last year
- Weekly Data Engineering Newsletter☆93Updated 2 months ago
- Yet Another (Spark) ETL Framework☆18Updated 10 months ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 9 months ago
- Official dbt adapter for Vertica☆25Updated this week
- dbt module for myBI connect☆11Updated last year
- ☆13Updated 7 months ago
- ☆37Updated 6 months ago
- The go to demo for public and private dbt Learn☆70Updated 2 weeks ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆141Updated 2 weeks ago
- Adaptation postgres adapter for Greenplum☆32Updated 6 months ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆86Updated this week
- Code snippets for Data Engineering Design Patterns book☆27Updated this week
- ITSumma Spark Greenplum Connector☆34Updated 5 months ago
- Data Tools Subjective List☆80Updated last year
- Realtime monitoring of running, queued and blocked queries in Snowflake☆21Updated 7 months ago
- A DataOps framework for building a lakehouse.☆23Updated this week
- Airflow training for the crunch conf☆105Updated 5 years ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆35Updated last month
- Flowchart for debugging Spark applications☆100Updated last week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆52Updated 5 months ago