garystafford / streaming-sales-generatorLinks
Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python
☆44Updated 3 years ago
Alternatives and similar repositories for streaming-sales-generator
Users that are interested in streaming-sales-generator are comparing it to the libraries listed below
Sorting:
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Simple stream processing pipeline☆110Updated last year
- Delta Lake examples☆235Updated last year
- Code for dbt tutorial☆165Updated 3 months ago
- Delta Lake Documentation☆51Updated last year
- ☆107Updated 11 months ago
- Code snippets for Data Engineering Design Patterns book☆302Updated 2 weeks ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆182Updated 2 years ago
- A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.☆38Updated 4 years ago
- build dw with dbt☆50Updated last year
- Execution of DBT models using Apache Airflow through Docker Compose☆126Updated 2 years ago
- ☆269Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆77Updated 4 years ago
- Cloned by the `dbt init` task☆62Updated last year
- Template for a data contract used in a data mesh.☆486Updated last year
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆91Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆165Updated 4 years ago
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆135Updated 3 years ago
- Docker Airflow - Contains a docker compose file for Airflow 2.0☆69Updated 3 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 4 years ago
- Docker with Airflow and Spark standalone cluster☆262Updated 2 years ago
- Pyspark boilerplate for running prod ready data pipeline☆29Updated 4 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆251Updated 2 weeks ago
- New Generation Opensource Data Stack Demo☆454Updated 2 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Updated 5 years ago
- Spark on Kubernetes using Helm☆33Updated 5 years ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆169Updated 3 months ago
- Materials for the next course☆25Updated 2 years ago
- ☆92Updated 10 months ago