garystafford / streaming-sales-generator
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for streaming-sales-generator
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆70Updated 3 years ago
- Delta Lake examples☆207Updated last month
- ☆32Updated 6 months ago
- Delta Lake Documentation☆46Updated 5 months ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆167Updated last year
- ☆66Updated last month
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆83Updated 6 months ago
- ☆48Updated 8 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆163Updated last week
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆41Updated 7 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆24Updated 8 months ago
- Code snippets for Data Engineering Design Patterns book☆40Updated last week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆84Updated 8 months ago
- A Table format agnostic data sharing framework☆38Updated 9 months ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Updated 2 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆65Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆111Updated 4 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆189Updated last week
- Simple stream processing pipeline☆92Updated 5 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆56Updated last year
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆76Updated last week
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- ☆13Updated 9 months ago
- ☆43Updated 3 months ago
- Cloned by the `dbt init` task☆59Updated 6 months ago
- ☆13Updated last year