ssp-data / data-engineering-devopsLinks
Full stack data engineering tools and infrastructure set-up
☆57Updated 4 years ago
Alternatives and similar repositories for data-engineering-devops
Users that are interested in data-engineering-devops are comparing it to the libraries listed below
Sorting:
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- New generation opensource data stack☆76Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- Template for Data Engineering and Data Pipeline projects☆116Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 3 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Updated 2 years ago
- ☆79Updated last week
- Code for dbt tutorial☆167Updated 4 months ago
- Utility functions for dbt projects running on Spark☆34Updated last month
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 3 years ago
- Data engineering with dbt, published by Packt☆89Updated 5 months ago
- ☆21Updated last year
- Delta Lake Documentation☆53Updated last year
- Cloned by the `dbt init` task☆62Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆89Updated 9 months ago
- csv and flat-file sniffer built in Rust.☆45Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 9 months ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆55Updated 3 months ago
- ☆40Updated 10 months ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Updated last month
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆131Updated last week
- Fake Pandas / PySpark DataFrame creator☆48Updated last year
- Weekly Data Engineering Newsletter☆96Updated last year
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆69Updated 3 weeks ago
- Some example projects for Data Engineers to build, end-to-end.☆37Updated 2 years ago
- Example repo to create end to end tests for data pipeline.☆25Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 3 years ago