datastacktv / apache-beam-explainedLinks

Source code for the YouTube video, Apache Beam Explained in 12 Minutes

☆21

Alternatives and similar repositories for apache-beam-explained

Users that are interested in apache-beam-explained are comparing it to the libraries listed below

Sorting:

vincentteyssier / apache-beam-tutorial
☆20Updated 5 years ago
PacktPublishing / Building-Big-Data-Pipelines-with-Apache-Beam
Building Big Data Pipelines with Apache Beam, published by Packt
☆86Updated 2 years ago
griscz / beam-college
Repository for Beam College sessions
☆109Updated 4 years ago
datastacktv / apache-beam-batch-processing
Public source code for the Batch Processing with Apache Beam (Python) online course
☆18Updated 4 years ago
startreedata / pinot-recipes
This repository contains recipes for Apache Pinot.
☆30Updated 5 months ago
idealo / terraform-emr-pyspark
Quickstart PySpark with Anaconda on AWS/EMR using Terraform
☆47Updated 7 months ago
PacktPublishing / Data-Engineering-with-Scala-and-Spark
Data Engineering with Scala, published by Packt
☆25Updated last year
GoogleCloudPlatform / dataflow-sample-applications
☆129Updated last year
PacktPublishing / AWS-Certified-Data-Analytics-Specialty-2023-Hands-on
Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt
☆41Updated last year
garystafford / kafka-connect-msk-demo
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
☆67Updated 3 years ago
GoogleCloudPlatform / qwiklabs-training-content
markup to create labs for courses from the Google Cloud training catalog.
☆49Updated 3 years ago
astronomer / airflow-example-dags
Sample Airflow DAGs
☆62Updated 2 years ago
axel-sirota / productionalizing-data-pipelines-airflow
Productionalizing Data Pipelines with Apache Airflow
☆113Updated 3 years ago
mneedham / real-time-analytics-book
☆42Updated last year
shravan-kuchkula / udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆90Updated 3 years ago
velascoluis / dbt-ci-cd-gke
CICD pipeline that deploys a dbt image on a GKE cluster
☆11Updated 4 years ago
GoogleCloudPlatform / serverless-spark-workshop
Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service
☆71Updated last year
stwind / airflow-on-kubernetes
Bare minimal Airflow on Kubernetes (Local, EKS, AKS)
☆53Updated 5 years ago
marclamberti / webinar-airflow-chart
Materials of the Official Helm Chart Webinar
☆27Updated 4 years ago
developer-advocacy-dremio / definitive-guide-to-apache-iceberg
☆89Updated 6 months ago
marclamberti / airflow-materials-aws
Materials for the next course
☆25Updated 2 years ago
paiml / awsbigdata
AWS Big Data Certification
☆25Updated 6 months ago
linuxacademy / content-google-cloud-run-deep-dive
Repository for Google Cloud Run Deep Dive
☆11Updated 5 years ago
bartosz25 / data-engineering-design-patterns-book
Code snippets for Data Engineering Design Patterns book
☆138Updated 4 months ago
apssouza22 / big-data-pipeline-lambda-arch
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
☆179Updated last month
PacktPublishing / Data-Engineering-with-Apache-Spark-Delta-Lake-and-Lakehouse
Data Engineering with Spark and Delta Lake
☆102Updated 2 years ago
polyzos / stream-processing-with-apache-flink
☆58Updated last year
asaharland / beam-pipeline-examples
Apache Beam examples for running on Google Cloud Dataflow.
☆30Updated 6 years ago
GoogleCloudPlatform / dlp-dataflow-deidentification
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
☆93Updated 11 months ago
PacktPublishing / Streaming-Data-Solutions-with-Amazon-Kinesis
Streaming Data Solutions with Amazon Kinesis, Published by Packt
☆22Updated 4 years ago