PacktPublishing / Bigdata-on-Kubernetes
Bigdata on Kubernetes, Published by Packt
☆14Updated last month
Related projects: ⓘ
- Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…☆25Updated 5 months ago
- ☆23Updated 2 years ago
- ☆16Updated last month
- Data Engineering with Databricks Cookbook, published by Packt☆26Updated 3 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆28Updated 4 months ago
- end-to-end data engineering project☆15Updated 7 months ago
- Data engineering with dbt, published by Packt☆55Updated 6 months ago
- Data Engineering with Scala, published by Packt☆16Updated 7 months ago
- ☆22Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆95Updated 10 months ago
- Data Engineering com Apache Spark☆43Updated 3 years ago
- Databricks ML in Action, Published by Packt☆19Updated 4 months ago
- Building ETL Pipelines with Python☆97Updated 2 months ago
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆38Updated 10 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆20Updated 6 months ago
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 2 years ago
- ☆15Updated 5 months ago
- Practical Machine Learning on Databricks, published by packt☆15Updated 10 months ago
- An exercise running Kafka, Kafka Connect, PostgreSQL, Superset and AWS S3☆21Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆86Updated last year
- Sample repo for startdataengineering DE 101 free course☆24Updated 2 months ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆61Updated last year
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆32Updated 2 weeks ago
- Criando Lambda Functions para Ingerir Dados de APIs com AWS CDK☆14Updated 2 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆125Updated last year
- Delta Lake Documentation☆45Updated 3 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆45Updated 11 months ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated last year
- ☆36Updated 2 years ago