agile-lab-dev / witboost-starter-kitLinks

Witboost is a versatile platform that addresses a wide range of sophisticated data engineering challenges. The Starter Kit showcases the integration capabilities and provides a "batteries-included" product.

☆25

Alternatives and similar repositories for witboost-starter-kit

Users that are interested in witboost-starter-kit are comparing it to the libraries listed below

Sorting:

mikulskibartosz / check-engine
Data validation library for PySpark 3.0.0
☆33Updated 2 years ago
agile-lab-dev / Data-Product-Specification
An open specification for data products in Data Mesh
☆63Updated 3 weeks ago
JacekMajchrzak / awesome-datamesh
☆97Updated 2 years ago
mrpowers-io / jodie
Delta lake and filesystem helper methods
☆51Updated last year
Nike-Inc / brickflow
Pythonic Programming Framework to orchestrate jobs in Databricks Workflow
☆218Updated 2 weeks ago
delta-io / delta-examples
Delta Lake examples
☆229Updated last year
rajagurunath / lakehouse-sharing
A Table format agnostic data sharing framework
☆39Updated last year
StabRise / spark-pdf
PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
☆75Updated 5 months ago
victorcouste / trino-dbt-demo
Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database
☆76Updated 4 years ago
souvik-databricks / dlt-with-debug
A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …
☆49Updated 2 years ago
Nike-Inc / spark-expectations
A Python Library to support running data quality rules while the spark job is running⚡
☆189Updated this week
sibytes / yetl
Yet Another (Spark) ETL Framework
☆21Updated last year
yaooqinn / itachi
A library that brings useful functions from various modern database management systems to Apache Spark
☆60Updated 2 years ago
AbePabbathi / lakehouse-tacklebox
This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.
☆46Updated 8 months ago
MrPowers / mack
Delta Lake helper methods in PySpark
☆324Updated last year
mrpowers-io / spark-style-guide
Spark style guide
☆263Updated last year
dbt-labs / dbt-learn-group-training
The go to demo for public and private dbt Learn
☆80Updated 6 months ago
SemyonSinchenko / flake8-pyspark-with-column
A flake8 plugin that detects of usage withColumn in a loop or inside reduce
☆28Updated 3 months ago
MaterializeInc / mz-hack-day-2022
Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!
☆60Updated 3 years ago
dimajix / flowman
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…
☆96Updated 2 weeks ago
TrivadisPF / platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
☆77Updated this week
data-catering / data-caterer
Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
☆69Updated last week
jaceklaskowski / spark-delta-lake-workshop
Spark and Delta Lake Workshop
☆22Updated 3 years ago
kaxil / airflowctl
A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects
☆223Updated 5 months ago
starlake-ai / starlake
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
☆154Updated this week
conveyordata / data-product-portal
Data product portal created by Dataminded
☆192Updated this week
delta-io / delta-docs
Delta Lake Documentation
☆50Updated last year
BauplanLabs / no-jvm-wap-with-iceberg
A write-audit-publish implementation on a data lake without the JVM
☆46Updated last year
dbt-labs / spark-utils
Utility functions for dbt projects running on Spark
☆33Updated 8 months ago
opendatamesh-initiative / odm-specification-dpdescriptor
The Data Product Descriptor Specification (DPDS) Repository
☆80Updated 9 months ago