IhorLuk / medium-materials
☆14Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for medium-materials
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated last year
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆11Updated 2 years ago
- Retail data pipeline using Airflow, Dbt, Soda, GCP (GCS and BigQuery) and Metabase☆32Updated 4 months ago
- Portfolio of projects and studies conducted in data engineering.☆33Updated 6 months ago
- ☆86Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆21Updated 2 years ago
- AWS ETL Pipleine☆20Updated 6 months ago
- Building ETL Pipelines with Python☆106Updated 4 months ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆83Updated last year
- Data science tips and tricks☆16Updated 6 months ago
- ☆38Updated 4 months ago
- An exercise running Kafka, Kafka Connect, PostgreSQL, Superset and AWS S3☆21Updated 3 years ago
- Ciência de dados☆12Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆37Updated last year
- Script para ingestão de dados do Mercado Bitcoin☆11Updated last year
- end-to-end data engineering project☆20Updated 9 months ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆14Updated 5 years ago
- Repository for the book Simplifying Machine Learning with PyCaret.☆60Updated last year
- Data Engineering with Scala, published by Packt☆19Updated 9 months ago
- Repositório central do segundo Workshop☆15Updated last year
- ☆13Updated 10 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆32Updated 11 months ago
- A Series of Notebooks on how to start with Kafka and Python☆153Updated last year
- streamlit app for visualizing fidelity account export csv data☆44Updated 2 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated last year
- ☆14Updated 5 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆109Updated last year
- Apache Airflow Best Practices, published by Packt☆20Updated 2 weeks ago
- ☆15Updated 9 months ago