ant-laz / streamingworkshopLinks
Step by step development of a streaming pipeline in Python
☆13Updated 2 years ago
Alternatives and similar repositories for streamingworkshop
Users that are interested in streamingworkshop are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- ☆184Updated 2 months ago
- Repository for Beam College sessions☆111Updated 4 years ago
- ☆130Updated last year
- BigQuery DataFrames (also known as BigFrames)☆270Updated this week
- Solution Accelerators for Serverless Spark on GCP, the industry's first auto-scaling and serverless Spark as a service☆74Updated last year
- Fraudfinder: A comprehensive lab series on how to build a real-time fraud detection system on Google Cloud☆237Updated 8 months ago
- ☆282Updated last year
- An end to end demo of Google's Cloud data and analytic stack.☆274Updated 3 weeks ago
- Data Engineering with Google Cloud Platform, published by Packt☆118Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆60Updated last year
- ☆120Updated 4 months ago
- Use the "develop" branch☆34Updated last year
- Apache Beam Python examples and templates.☆14Updated 2 years ago
- Data Quality Engine for BigQuery☆278Updated 6 months ago
- Code snippets for Data Engineering Design Patterns book☆275Updated 8 months ago
- Cloud-native, data onboarding architecture for Google Cloud Datasets☆167Updated last month
- Data Engineering on Google Cloud Platform☆378Updated last year
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆274Updated 4 months ago
- Source code accompanying: BigQuery: The Definitive Guide by Lakshmanan & Tigani to be published by O'Reilly Media☆546Updated last year
- ☆68Updated 2 weeks ago
- Just starting your DE journey or along the way already?. I will be sharing a short list of DATA-ENGINEERING-CENTRED books that covers the…☆34Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated 2 years ago
- The Dataflow Solution Guides offer full end-to-end deployment for the most common streaming solutions to run on Dataflow.☆40Updated last month
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆90Updated last year
- ☆11Updated 2 years ago
- Data pipeline with dbt, Airflow, Great Expectations☆165Updated 4 years ago
- An end-to-end example of MLOps on Google Cloud using TensorFlow, TFX, and Vertex AI☆406Updated last year
- ☆39Updated last week
- ☆54Updated 2 years ago