garystafford / streaming-sales-generatorLinks
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
☆44Updated 2 years ago
Alternatives and similar repositories for streaming-sales-generator
Users that are interested in streaming-sales-generator are comparing it to the libraries listed below
Sorting:
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Delta Lake examples☆231Updated last year
- Simple stream processing pipeline☆110Updated last year
- Code snippets for Data Engineering Design Patterns book☆271Updated 8 months ago
- Pyspark boilerplate for running prod ready data pipeline☆29Updated 4 years ago
- Delta Lake Documentation☆50Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆181Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆77Updated 4 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- ☆269Updated last year
- Code for dbt tutorial☆164Updated 2 months ago
- New generation opensource data stack☆75Updated 3 years ago
- New Generation Opensource Data Stack Demo☆449Updated 2 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆124Updated 2 years ago
- Materials for the next course☆25Updated 2 years ago
- ☆16Updated last year
- ☆104Updated 10 months ago
- Data pipeline with dbt, Airflow, Great Expectations☆165Updated 4 years ago
- Quick Guides from Dremio on Several topics☆79Updated last month
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 3 years ago
- Template for a data contract used in a data mesh.☆480Updated last year
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- Cloned by the `dbt init` task☆62Updated last year
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Updated 3 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.☆168Updated 10 months ago
- Apache Flink (Pyflink) and Related Projects☆42Updated 7 months ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆69Updated 3 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated 2 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆273Updated last month