IhorLuk / medium-materialsLinks

☆14

Alternatives and similar repositories for medium-materials

Users that are interested in medium-materials are comparing it to the libraries listed below

Sorting:

dogukannulu / csv_extract_airflow_docker
Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.
☆37Updated 2 years ago
g-lorena / aws_etl_pipeline
AWS ETL Pipleine
☆30Updated last year
josephmachado / analytical_dp_with_sql
Code for my "Efficient Data Processing in SQL" book.
☆60Updated last year
josephmachado / python_essentials_for_data_engineers
Code for blog at https://www.startdataengineering.com/post/python-for-de/
☆89Updated last year
pyjaime / docker-airflow-spark
Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks
☆24Updated 3 years ago
dogukannulu / airflow_kafka_cassandra_mongodb
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆42Updated 2 years ago
josephmachado / socialetl
Project for "Data pipeline design patterns" blog.
☆47Updated last year
bartosz25 / data-engineering-design-patterns-book
Code snippets for Data Engineering Design Patterns book
☆271Updated 8 months ago
razevedo1994 / razv-data-engineering
Portfolio of projects and studies conducted in data engineering.
☆34Updated 8 months ago
ankurchavda / data-engineering-zoomcamp
A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc
☆14Updated 3 years ago
PacktPublishing / Apache-Airflow-Best-Practices
Apache Airflow Best Practices, published by Packt
☆51Updated last year
astronautyates / AirflowSnowflakeDBTQuickstart
☆26Updated 2 years ago
dogukannulu / kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
☆143Updated 2 years ago
arezamoosavi / AcidOnSpark-ETL
Delta-Lake, ETL, Spark, Airflow
☆48Updated 3 years ago
josephmachado / online_store
End to end data engineering project
☆57Updated 3 years ago
Dorianteffo / modern-data-platform
End-to-end data platform leveraging the Modern data stack
☆52Updated last year
josephmachado / simple_dbt_project
Code for dbt tutorial
☆165Updated 2 months ago
itversity / data-engineering-spark
☆88Updated 3 years ago
cnstlungu / portable-data-stack-airflow
A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset
☆46Updated last year
alonsomedo / os-data-stack
Building a Data Pipeline with an Open Source Stack
☆54Updated 4 months ago
PacktPublishing / Bigdata-on-Kubernetes
Bigdata on Kubernetes, Published by Packt
☆36Updated last year
jess197 / football_statistics_etl_project
☆13Updated last year
alanceloth / Retail_Data_Pipeline
Retail data pipeline using Airflow, Dbt, Soda, GCP (GCS and BigQuery) and Metabase
☆39Updated last year
dogukannulu / streaming_data_processing
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
☆64Updated 2 years ago
yTek01 / docker-spark-airflow
☆40Updated 2 years ago
ssp-data / data-engineering-devops
Full stack data engineering tools and infrastructure set-up
☆57Updated 4 years ago
josephmachado / docker_for_data_engineers
Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
☆40Updated last year
TJaniF / airflow-elt-blueprint
A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.
☆79Updated 2 years ago
Armaan1Gohil / dataengineering-tech-stack
Local Environment to Practice Data Engineering
☆143Updated 10 months ago
luanmorenomaciel / de-apache-spark
Data Engineering com Apache Spark
☆42Updated 4 years ago