Local Environment to Practice Data Engineering
β144Dec 30, 2024Updated last year
Alternatives and similar repositories for dataengineering-tech-stack
Users that are interested in dataengineering-tech-stack are comparing it to the libraries listed below
Sorting:
- universal-datalakehouse-postgres-ingestion-deltastreamerβ11Apr 7, 2024Updated last year
- π‘ Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.β59Jan 18, 2025Updated last year
- β16Aug 23, 2022Updated 3 years ago
- This repo via a real world use case, shows how to launch dbt models from a DAG in Apache Airflow.β13Apr 24, 2025Updated 10 months ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.β15Jun 19, 2022Updated 3 years ago
- Code snippets for Data Engineering Design Patterns bookβ349Feb 16, 2026Updated last week
- Realtime Data Engineering Projectβ30Jan 12, 2025Updated last year
- Deploy a complete data stack in just a couple of minutes.β15Mar 6, 2024Updated last year
- All the material from the Udemy course "Beyond Jupyter Notebooks"β18Mar 12, 2019Updated 6 years ago
- The Christmas Project is a festive-themed data engineering initiative designed to integrate and analyze diverse datasets, creating a compβ¦β19Jan 11, 2025Updated last year
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,β¦β47Oct 14, 2024Updated last year
- FastAPI ASGI with Django ORM and adminβ15May 15, 2022Updated 3 years ago
- Nyc_Taxi_Data_Pipeline - DE Projectβ139Oct 21, 2024Updated last year
- β21May 13, 2025Updated 9 months ago
- β22Aug 31, 2024Updated last year
- β22Apr 2, 2025Updated 10 months ago
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation anβ¦β23Nov 21, 2023Updated 2 years ago
- In this repository we store all materials for dlt workshops, courses, etc.β253Dec 11, 2025Updated 2 months ago
- β146Jan 31, 2023Updated 3 years ago
- β180Aug 24, 2025Updated 6 months ago
- Cloud Functions streaming insert to BigQuery (with Cloud Pub/Sub trigger). In this example, the function will make a REST API call to getβ¦β28Aug 28, 2023Updated 2 years ago
- Project with Airflow + Spark + MinIO + Postgres + Python3.8β28Sep 9, 2022Updated 3 years ago
- For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learningβ99Nov 22, 2017Updated 8 years ago
- β19Feb 25, 2022Updated 4 years ago
- Feature demos, integration guides & hands-on labs/projects using Kpow, Flex, Kafka, Flink, Iceberg & moreβ50Updated this week
- Data Engineering Project with Hadoop HDFS and Kafkaβ122Nov 4, 2023Updated 2 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testinβ¦β75Sep 2, 2023Updated 2 years ago
- Data Engineering Practice Problemsβ2,547Jan 8, 2025Updated last year
- datacamp Data Engineer with Python course. 73 hours/ 19 Courses /2 Skill Assessmentsβ139Nov 29, 2022Updated 3 years ago
- Upload of all my presentations which I've been doing in the pastβ10Feb 5, 2026Updated 3 weeks ago
- ML Model to Predict that Rating of a User based on other factorsβ17May 22, 2025Updated 9 months ago
- β10Aug 6, 2024Updated last year
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagsterβ15Sep 9, 2021Updated 4 years ago
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The ficticiβ¦β14Sep 30, 2024Updated last year
- En este repositorio habra una gran variedad de ejercicios tecnicos realizados en pythonβ12Feb 16, 2025Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize aβ¦β36Dec 15, 2025Updated 2 months ago
- Code for DE101 book at https://de101.startdataengineering.com/β87Updated this week
- Code snippets used in demos recorded for the blog.β38Feb 17, 2026Updated last week
- β10Jul 21, 2022Updated 3 years ago