PacktPublishing / Bigdata-on-KubernetesLinks

Bigdata on Kubernetes, Published by Packt

☆35

Alternatives and similar repositories for Bigdata-on-Kubernetes

Users that are interested in Bigdata-on-Kubernetes are comparing it to the libraries listed below

Sorting:

PacktPublishing / Apache-Airflow-Best-Practices
Apache Airflow Best Practices, published by Packt
☆44Updated 9 months ago
josephmachado / docker_for_data_engineers
Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
☆38Updated last year
bartosz25 / data-engineering-design-patterns-book
Code snippets for Data Engineering Design Patterns book
☆142Updated 4 months ago
PacktPublishing / Data-engineering-with-dbt
Data engineering with dbt, published by Packt
☆85Updated last year
PacktPublishing / Data-Engineering-with-Databricks-Cookbook
Data Engineering with Databricks Cookbook, published by Packt
☆94Updated last year
PacktPublishing / Data-Engineering-with-Apache-Spark-Delta-Lake-and-Lakehouse
Data Engineering with Spark and Delta Lake
☆102Updated 2 years ago
minhadona / data_engineer_interview_challenges
Found a data engineering challenge or participated in a selection process ? Share with us!
☆65Updated 2 years ago
sarthak-sarbahi / data-analytics-minio-spark
☆16Updated last year
noahgift / data-engineering-and-dataops
Duke MIDS: Data Engineering and DataOps Course
☆67Updated 6 months ago
PacktPublishing / Data-Engineering-with-Google-Cloud-Platform
Data Engineering with Google Cloud Platform, published by Packt
☆119Updated last year
PacktPublishing / Building-ETL-Pipelines-with-Python
Building ETL Pipelines with Python
☆150Updated last year
PacktPublishing / Data-Engineering-with-Google-Cloud-Platform-Second-Edition
Data Engineering with Google Cloud Platform - Second Edition, published by Packt
☆41Updated last year
dogukannulu / airflow_kafka_cassandra_mongodb
Produce Kafka messages, consume them and upload into Cassandra, MongoDB.
☆42Updated last year
itversity / data-engineering-spark
☆88Updated 2 years ago
derar-alhussein / Databricks-Certified-Data-Engineer-Professional
The resources of the preparation course for Databricks Data Engineer Professional certification exam
☆127Updated last month
dogukannulu / kafka_spark_structured_streaming
Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra
☆141Updated 2 years ago
PacktPublishing / Data-Engineering-with-Scala-and-Spark
Data Engineering with Scala, published by Packt
☆25Updated last year
simardeep1792 / Data-Engineering-Streaming-Project
☆41Updated last year
alonsomedo / os-data-stack
Building a Data Pipeline with an Open Source Stack
☆55Updated last month
PacktPublishing / Data-Engineering-with-AWS-2nd-edition
Data Engineering with AWS, 2nd edition - Published by Packt
☆150Updated last year
abeltavares / real-time-data-pipeline
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
☆47Updated 6 months ago
josephmachado / adv_data_transformation_in_sql
Code for "Advanced data transformations in SQL" free live workshop
☆83Updated 3 months ago
airscholar / SparkingFlow
This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…
☆44Updated last year
ssp-data / data-engineering-devops
Full stack data engineering tools and infrastructure set-up
☆55Updated 4 years ago
TJaniF / airflow-elt-blueprint
A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.
☆74Updated last year
PacktPublishing / Essential-PySpark-for-Scalable-Data-Analytics
Essential PySpark for Scalable Data Analytics, published by Packt
☆45Updated 2 years ago
benniehaelen / delta-lake-up-and-running
Companion repository for the book 'Delta Lake Up and Running'
☆47Updated 4 months ago
arezamoosavi / AcidOnSpark-ETL
Delta-Lake, ETL, Spark, Airflow
☆47Updated 2 years ago
Armaan1Gohil / dataengineering-tech-stack
Local Environment to Practice Data Engineering
☆143Updated 7 months ago
dogukannulu / streaming_data_processing
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
☆63Updated 2 years ago