sodadata / soda-streaming
☆23Updated 3 years ago
Alternatives and similar repositories for soda-streaming:
Users that are interested in soda-streaming are comparing it to the libraries listed below
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- An open specification for data products in Data Mesh☆59Updated 6 months ago
- Make simple storing test results and visualisation of these in a BI dashboard☆44Updated last month
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆111Updated last year
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆119Updated 3 months ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Example Dagster Cloud code for the Hooli Data Engineering organization.☆1Updated this week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated 2 weeks ago
- Utility functions for dbt projects running on Spark☆33Updated 2 months ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- Weekly Data Engineering Newsletter☆94Updated 9 months ago
- Evaluation Matrix for Change Data Capture☆25Updated 9 months ago
- Test all the data☆37Updated last year
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- ☆21Updated 3 years ago
- Python package for querying iceberg data through duckdb.☆67Updated last year
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆108Updated this week
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- dbt ksqlDB adapter☆27Updated 2 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆172Updated last year
- Unity Catalog UI☆40Updated 8 months ago
- A Table format agnostic data sharing framework☆38Updated last year