yahwang / Awesome-Data-EngineeringLinks
π(GitBook) A curated list of awesome Data Engineering resources
β36Updated this week
Alternatives and similar repositories for Awesome-Data-Engineering
Users that are interested in Awesome-Data-Engineering are comparing it to the libraries listed below
Sorting:
- β83Updated 2 years ago
- λΉ λ°μ΄ν° pipeline κ΅¬μ± μμ κΈ°μ λ€μ κ΄ν coding μ€μ΅ λ° μ°κ΅¬β41Updated 5 years ago
- β28Updated 2 years ago
- λ°μ΄ν° μμ§λμ΄ κΈ°μ μ 리β18Updated last year
- β110Updated 2 years ago
- Stream smartphone data with FastAPI, Kafka, QuestDB, and Docker.β26Updated last year
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.β31Updated last year
- DataOps(Data Operation), MLOps(Machine Learning Operation) Contentsβ131Updated 4 years ago
- Awesome list for datapipelineβ34Updated 2 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,β¦β90Updated 3 years ago
- Data engineering interviews Q&A for data community by data communityβ63Updated 5 years ago
- data engineer advanced training courseβ9Updated 2 months ago
- β14Updated 8 months ago
- DEμ§λ¬΄μ νμν λͺ¨λ κ²β202Updated 2 months ago
- Gitbook Repo for Practical Data Pipelineβ25Updated 3 years ago
- Code snippets for Data Engineering Design Patterns bookβ132Updated 4 months ago
- Kafka Connect connector that reads JSON data from Apache Kafka and send JSON record to Another Kafka topic.β51Updated last year
- Spark 곡μ λ¬Έμ νκ΅μ΄ν λ²μβ16Updated 3 years ago
- Web tool for operating kafka connect https://hub.docker.com/r/officialkakao/kafka-connect-webβ115Updated last year
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that utiβ¦β29Updated 2 years ago
- λ°μ΄ν° & λΆμ κ±°λ²λμ€ μ κ³ λ₯Ό μν μμ§μ λ νΌλ°μ€λ€μ μμ§νκ³ μκ°μ λλ μ μμ΅λλ€.β73Updated 2 years ago
- This project shows how to serve an TF based image classification model as a web service with TFServing, Docker, and Kubernetes(GKE).β125Updated 2 years ago
- β48Updated 3 years ago
- AWS SageMakerλ₯Ό μ΄μ©ν MLOpsμ LLMOpsβ32Updated last year
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for β¦β137Updated 5 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)β59Updated last year
- A Snowflake GPT Demo using SqlAlchemyβ23Updated 2 years ago
- β41Updated last year
- β19Updated 9 months ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsianβ216Updated 2 years ago