yahwang / Awesome-Data-Engineering
π(GitBook) A curated list of awesome Data Engineering resources
β35Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Data-Engineering:
Users that are interested in Awesome-Data-Engineering are comparing it to the libraries listed below
- β82Updated 2 years ago
- Awesome list for datapipelineβ34Updated 2 years ago
- β28Updated 2 years ago
- β49Updated 3 years ago
- λΉ λ°μ΄ν° pipeline κ΅¬μ± μμ κΈ°μ λ€μ κ΄ν coding μ€μ΅ λ° μ°κ΅¬β41Updated 5 years ago
- DataOps(Data Operation), MLOps(Machine Learning Operation) Contentsβ131Updated 4 years ago
- Full stack data engineering tools and infrastructure set-upβ51Updated 4 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β56Updated last year
- Gitbook Repo for Practical Data Pipelineβ25Updated 3 years ago
- β106Updated last year
- AWS SageMakerλ₯Ό μ΄μ©ν MLOpsμ LLMOpsβ32Updated last year
- λ°μ΄ν° μμ§λμ΄ κΈ°μ μ 리β18Updated last year
- Repository for Practical Data Pipeline Codeβ11Updated 3 years ago
- Data engineering interviews Q&A for data community by data communityβ63Updated 4 years ago
- Code snippets for Data Engineering Design Patterns bookβ80Updated last month
- This project shows how to serve an TF based image classification model as a web service with TFServing, Docker, and Kubernetes(GKE).β122Updated 2 years ago
- Spark 곡μ λ¬Έμ νκ΅μ΄ν λ²μβ16Updated 3 years ago
- (project & tutorial) dag pipeline tests + ci/cd setupβ87Updated 4 years ago
- This is a repo with links to everything you'd ever want to learn about data engineeringβ10Updated 4 months ago
- Grafana Plugin for Snowflakeβ44Updated 3 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,β¦β90Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utiβ¦β29Updated last year
- β40Updated 9 months ago
- β13Updated 5 months ago
- Elastic Stack Data Pipeline κ΅¬μΆ μ€μ΅β19Updated 3 years ago
- AB Testing π related articlesβ37Updated last year
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.β30Updated last year
- Weekly Data Engineering Newsletterβ95Updated 9 months ago
- λ°μ΄ν° & λΆμ κ±°λ²λμ€ μ κ³ λ₯Ό μν μμ§μ λ νΌλ°μ€λ€μ μμ§νκ³ μκ°μ λλ μ μμ΅λλ€.β74Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β54Updated last year