datastacktv / apache-beam-batch-processingLinks

Public source code for the Batch Processing with Apache Beam (Python) online course

☆18

Alternatives and similar repositories for apache-beam-batch-processing

Users that are interested in apache-beam-batch-processing are comparing it to the libraries listed below

Sorting:

datastacktv / kubeflow-introduction
Code examples for the Introduction to Kubeflow course
☆14Updated 4 years ago
ssp-data / data-engineering-devops
Full stack data engineering tools and infrastructure set-up
☆57Updated 4 years ago
PacktPublishing / Building-Big-Data-Pipelines-with-Apache-Beam
Building Big Data Pipelines with Apache Beam, published by Packt
☆87Updated 2 years ago
kadnan / Airflow-Tutorial
Basic tutorial of using Apache Airflow
☆36Updated 7 years ago
Aiven-Labs / python-fake-data-producer-for-apache-kafka
The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …
☆85Updated last year
omidvd79 / Big_Data_Demystified
Big Data Demystified meetup and blog examples
☆31Updated last year
monte-carlo-data / data-downtime-challenge
☆93Updated 2 years ago
tuanchris / cloud-data-lake
Data lake, data warehouse on GCP
☆57Updated 3 years ago
shravan-kuchkula / udacity-data-eng-proj-1
Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…
☆89Updated 4 years ago
kislerdm / data-engineering-interviews
Data engineering interviews Q&A for data community by data community
☆64Updated 5 years ago
trallard / airflow-tutorial
🐍💨 Airflow tutorial for PyCon 2019
☆87Updated 2 years ago
benjigoldberg / udacity-airflow
Udacity Data Pipeline Exercises
☆15Updated 5 years ago
marcosmarxm / airflow-testing-ci-workflow
(project & tutorial) dag pipeline tests + ci/cd setup
☆89Updated 4 years ago
vim89 / datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…
☆55Updated 2 years ago
dbt-labs / dbt-starter-project
Cloned by the `dbt init` task
☆62Updated last year
jonathandinu / spark-ray-data-science
Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …
☆52Updated 10 months ago
sizrailev / life-around-data-code
Code snippets and tools published on the blog at lifearounddata.com
☆12Updated 5 years ago
mahdiqb / modern_data_platform
Sample configuration to deploy a modern data platform.
☆89Updated 3 years ago
jacobceles / intro-to-colab-pyspark-emr
A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …
☆20Updated 4 years ago
MrPowers / ceja
PySpark phonetic and string matching algorithms
☆39Updated last year
astronomer / webinar-dag-writing-best-practices
☆48Updated 4 years ago
datastacktv / apache-beam-explained
Source code for the YouTube video, Apache Beam Explained in 12 Minutes
☆21Updated 5 years ago
spbail / data-quality-tools
Content for a talk on "The wonderful world of data quality tools in Python"
☆18Updated 4 years ago
itversity / mastering-postgresql
Content related to Mastering Postgresql along with videos.
☆18Updated 4 years ago
zsvoboda / ngods
New generation opensource data stack
☆75Updated 3 years ago
josephmachado / cost_effective_data_pipelines
Cost Efficient Data Pipelines with DuckDB
☆60Updated 6 months ago
Aiven-Labs / python-notebooks-for-apache-kafka
A Series of Notebooks on how to start with Kafka and Python
☆152Updated 9 months ago
abhishek-ch / streamlit-healthcare-ML-App
Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!
☆31Updated 4 years ago
greatexpectationslabs / ge_tutorials
Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.
☆168Updated 2 years ago
adipolak / scaling-machine-learning-course
Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…
☆23Updated 2 years ago