sodadata / soda-streamingLinks
☆23Updated 4 years ago
Alternatives and similar repositories for soda-streaming
Users that are interested in soda-streaming are comparing it to the libraries listed below
Sorting:
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆169Updated 4 months ago
- An open specification for data products in Data Mesh☆63Updated 4 months ago
- Schema modelling framework for decentralised domain-driven ownership of data.☆261Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆97Updated this week
- Utility functions for dbt projects running on Spark☆34Updated last month
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated last week
- Terraform Provider for Airbyte API☆62Updated this week
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆52Updated last month
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs …☆160Updated 3 years ago
- ☆23Updated 4 years ago
- Data Product Portal created by Dataminded☆198Updated this week
- Data Tools Subjective List☆89Updated 2 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆182Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆96Updated last year
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆183Updated last month
- The Data Product Descriptor Specification (DPDS) Repository☆83Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆268Updated 10 months ago
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆70Updated last month
- Mapping of DWH database tables to business entities, attributes & metrics in Python, with automatic creation of flattened tables☆75Updated 2 years ago
- Unity Catalog UI☆43Updated last year
- Make dbt docs and Apache Superset talk to one another☆155Updated 4 months ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆42Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆92Updated this week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- Data Mesh Architecture☆84Updated 3 months ago
- Data Mesh Manager (Community Edition)☆53Updated 3 months ago