longbuivan / dotfile
This is dotfile, to Setup Development Environment as Data Engineer
☆14Updated 2 months ago
Related projects: ⓘ
- Nyc_Taxi_Data_Pipeline - DE Project☆62Updated last month
- End to end data engineering project☆49Updated last year
- Simple stream processing pipeline☆89Updated 3 months ago
- ☆56Updated 3 years ago
- Open source stack lakehouse☆25Updated 6 months ago
- Code for dbt tutorial☆138Updated 3 months ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆73Updated 9 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆56Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆225Updated 2 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆100Updated 2 months ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆83Updated last year
- Building a Data Pipeline with an Open Source Stack☆36Updated 2 months ago
- ☆94Updated last month
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- Sample project to demonstrate data engineering best practices☆156Updated 6 months ago
- ☆17Updated 2 months ago
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆44Updated 6 months ago
- This is project documentation templates derived from CRISP-DM to be used for Data Engineering projects.☆38Updated 3 years ago
- ☆41Updated 3 years ago
- Spark all the ETL Pipelines☆29Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆166Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Updated last year
- velib-v2___an ETL pipeline that employs batch and streaming jobs using spark, kafka, airflow, and other tools☆17Updated last week
- ☆59Updated 3 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆28Updated 4 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆71Updated last year
- Project for "Data pipeline design patterns" blog.☆41Updated last month
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆187Updated this week
- Delta Lake Documentation☆45Updated 3 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆37Updated 9 months ago