IhorLuk / medium-materialsLinks
☆14Updated 3 months ago
Alternatives and similar repositories for medium-materials
Users that are interested in medium-materials are comparing it to the libraries listed below
Sorting:
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆37Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆48Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Updated 3 years ago
- AWS ETL Pipleine☆30Updated last year
- End to end data engineering project☆57Updated 2 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆14Updated 3 years ago
- Code for my "Efficient Data Processing in SQL" book.☆58Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆85Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- Code snippets for Data Engineering Design Patterns book☆151Updated 5 months ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Updated 2 years ago
- Python data repo, jupyter notebook, python scripts and data.☆526Updated 8 months ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆21Updated last year
- Sample project to demonstrate data engineering best practices☆197Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆42Updated last year
- ☆88Updated 2 years ago
- End-to-end data platform leveraging the Modern data stack☆51Updated last year
- Project for "Data pipeline design patterns" blog.☆45Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆39Updated last year
- Data Engineering with Google Cloud Platform, published by Packt☆118Updated last year
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆142Updated 2 years ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆131Updated last year
- end-to-end data engineering project☆21Updated last year
- ☆40Updated 2 years ago
- Simple stream processing pipeline☆108Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆84Updated 4 months ago
- Code for dbt tutorial☆161Updated 3 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆102Updated 5 months ago
- A tutorial for the Great Expectations library.☆71Updated 4 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆25Updated 2 years ago