Project for "Data pipeline design patterns" blog.
☆50Aug 6, 2024Updated last year
Alternatives and similar repositories for socialetl
Users that are interested in socialetl are comparing it to the libraries listed below
Sorting:
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- ☆16Apr 26, 2024Updated last year
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- Near real time ETL to populate a dashboard.☆73Sep 9, 2025Updated 6 months ago
- Step by step instructions to create a production-ready data pipeline☆58Dec 23, 2024Updated last year
- Sample project to demonstrate data engineering best practices☆206Feb 24, 2024Updated 2 years ago
- various wrappers and functions over dictionary to add functionalities.☆13Feb 14, 2026Updated 3 weeks ago
- ☆21Mar 26, 2023Updated 2 years ago
- Code for dbt tutorial☆171Sep 9, 2025Updated 6 months ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Backend: No more asking where my money goes.☆10Jan 4, 2023Updated 3 years ago
- ☆16Feb 20, 2026Updated 2 weeks ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆101Jun 7, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆91May 5, 2025Updated 10 months ago
- ☆13Nov 12, 2022Updated 3 years ago
- C++ Library Management System Applying OOP Concepts☆11Oct 5, 2020Updated 5 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- ☆12May 25, 2017Updated 8 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Project on belief embedding☆20Jun 4, 2025Updated 9 months ago
- Python-based reports for Postgres including a data dictionary generator☆16Nov 13, 2015Updated 10 years ago
- Go library for efficient skyline queries☆18Aug 17, 2025Updated 6 months ago
- The project focuses on the drowsiness of IT employees, drivers, pilots, crane operators, student etc. These people need a system which ca…☆14Sep 13, 2018Updated 7 years ago
- Webhooks with live charting☆11Jun 3, 2021Updated 4 years ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆11Jan 15, 2019Updated 7 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Selenium Grid in ECS using Fargate Spot Containers☆14Feb 1, 2023Updated 3 years ago
- Fastapi boilerplate for API with user management and authentication using Keycloak☆10Jun 25, 2021Updated 4 years ago
- Python script for crawling ResearchGate.net papers.✨⭐️📎☆11Feb 4, 2022Updated 4 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 3 months ago
- Solved data engineering exercises using Pyspark☆15Aug 2, 2021Updated 4 years ago
- ☆24Oct 15, 2025Updated 4 months ago