PacktPublishing / Bigdata-on-Kubernetes
Bigdata on Kubernetes, Published by Packt
☆27Updated 3 months ago
Alternatives and similar repositories for Bigdata-on-Kubernetes:
Users that are interested in Bigdata-on-Kubernetes are comparing it to the libraries listed below
- Data Engineering with Scala, published by Packt☆22Updated 11 months ago
- Data Engineering with Databricks Cookbook, published by Packt☆62Updated 7 months ago
- Architecting Google Cloud Solutions,published by Packt☆13Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago
- Data Engineering with AWS, 2nd edition - Published by Packt☆126Updated last year
- Data Engineering with Google Cloud Platform - Second Edition, published by Packt☆29Updated 8 months ago
- Apache Airflow Best Practices, published by Packt☆30Updated 2 months ago
- ☆61Updated 3 weeks ago
- Duke MIDS: Data Engineering and DataOps Course☆64Updated 2 weeks ago
- Databricks ML in Action, Published by Packt☆27Updated 8 months ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆63Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆94Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- ☆40Updated 6 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆31Updated 8 months ago
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆58Updated 2 years ago
- Demo Codes will be shared here☆43Updated 2 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year
- Building ETL Pipelines with Python☆118Updated 6 months ago
- Realtime Data Engineering Project☆27Updated 2 weeks ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆21Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆173Updated 3 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆35Updated 10 months ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆131Updated last year
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆22Updated last year
- Data engineering with dbt, published by Packt☆66Updated 10 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆30Updated 10 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 5 months ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆39Updated last year