yahwang / Awesome-Data-Engineering
π(GitBook) A curated list of awesome Data Engineering resources
β34Updated last month
Alternatives and similar repositories for Awesome-Data-Engineering:
Users that are interested in Awesome-Data-Engineering are comparing it to the libraries listed below
- β81Updated last year
- λΉ λ°μ΄ν° pipeline κ΅¬μ± μμ κΈ°μ λ€μ κ΄ν coding μ€μ΅ λ° μ°κ΅¬β41Updated 5 years ago
- β28Updated 2 years ago
- This project shows how to serve an TF based image classification model as a web service with TFServing, Docker, and Kubernetes(GKE).β120Updated 2 years ago
- DataOps(Data Operation), MLOps(Machine Learning Operation) Contentsβ130Updated 3 years ago
- Code snippets for Data Engineering Design Patterns bookβ69Updated 2 weeks ago
- β102Updated last year
- β13Updated 3 months ago
- β49Updated 3 years ago
- λ°μ΄ν° μμ§λμ΄ κΈ°μ μ 리β18Updated last year
- Spark 곡μ λ¬Έμ νκ΅μ΄ν λ²μβ16Updated 3 years ago
- Gitbook Repo for Practical Data Pipelineβ25Updated 3 years ago
- Design/Implement stream/batch architecture on NYC taxi data | #DEβ26Updated 3 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.β29Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β50Updated last year
- Data engineering interviews Q&A for data community by data communityβ63Updated 4 years ago
- λ°μ΄ν° & λΆμ κ±°λ²λμ€ μ κ³ λ₯Ό μν μμ§μ λ νΌλ°μ€λ€μ μμ§νκ³ μκ°μ λλ μ μμ΅λλ€.β75Updated last year
- Weekly Data Engineering Newsletterβ94Updated 7 months ago
- Awesome list for datapipelineβ32Updated 2 years ago
- νλΉλ―Έλμ΄μμ μΆκ°ν γνμ΄μ¬κ³Ό λμ€ν¬λ₯Ό νμ©ν κ³ μ±λ₯ λ°μ΄ν° λΆμγ μ μμ€μ½λ μ μ₯μβ12Updated 4 years ago
- AB Testing π related articlesβ37Updated last year
- AWS SageMakerλ₯Ό μ΄μ©ν MLOpsμ LLMOpsβ33Updated last year
- Tutorial for Scala on Spark onlyβ12Updated 6 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatioβ¦β53Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setupβ86Updated 4 years ago
- κ΅¬κΈ λΉ μΏΌλ¦¬ μλ²½ κ°μ΄λβ48Updated 4 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).β15Updated 3 years ago
- β41Updated 7 months ago
- β33Updated 2 years ago
- Repository for Practical Data Pipeline Codeβ11Updated 3 years ago