mmphego / simple-etl
☆16Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for simple-etl
- Sample project to demonstrate data engineering best practices☆166Updated 8 months ago
- Code for dbt tutorial☆143Updated 5 months ago
- This project is for demonstrating knowledge of Data Engineering tools and concepts and also learning in the process☆45Updated last year
- End to end data engineering project☆51Updated 2 years ago
- ☆104Updated 3 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆241Updated 4 months ago
- ☆128Updated last year
- Project for "Data pipeline design patterns" blog.☆41Updated 3 months ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆58Updated last year
- Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…☆66Updated 11 months ago
- ☆113Updated last month
- Simple stream processing pipeline☆92Updated 5 months ago
- ☆21Updated last year
- Code for "Efficient Data Processing in Spark" Course☆247Updated last month
- End-to-end data platform leveraging the Modern data stack☆41Updated 7 months ago
- Hey this is the repo that has all the queries and data for my video game training series!☆133Updated 2 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 2 months ago
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆51Updated 8 months ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆50Updated 3 months ago
- In this repository we will store all materials for workshops, courses, etc.☆35Updated this week
- Data pipeline with dbt, Airflow, Great Expectations☆158Updated 3 years ago
- ☆143Updated 8 months ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆196Updated this week
- I will attempt to create my own spotify wrapped by collecting data from the spotify API, perform transformations and create informative d…☆74Updated last year
- ☆41Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆55Updated 5 months ago
- Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch …☆51Updated last week
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard.☆197Updated last year
- ☆132Updated 2 years ago