TrivadisPF / platys-modern-data-platform
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
☆75Updated this week
Alternatives and similar repositories for platys-modern-data-platform:
Users that are interested in platys-modern-data-platform are comparing it to the libraries listed below
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆141Updated 3 weeks ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆74Updated 3 years ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆232Updated 3 weeks ago
- Quick Guides from Dremio on Several topics☆70Updated 3 months ago
- ☆263Updated 6 months ago
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 2 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆94Updated last week
- Unity Catalog UI☆40Updated 7 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- A tool that makes it easy to run modular Trino environments locally.☆37Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆121Updated this week
- Data Tools Subjective List☆83Updated last year
- Data product portal created by Dataminded☆183Updated this week
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆32Updated this week
- New generation opensource data stack☆67Updated 2 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆61Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆181Updated 3 weeks ago
- Utility functions for dbt projects running on Trino☆21Updated last year
- Delta Lake Documentation☆49Updated 10 months ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆90Updated this week
- Schema modelling framework for decentralised domain-driven ownership of data.☆252Updated last year
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- An open specification for data products in Data Mesh☆56Updated 5 months ago
- Storage connector for Trino☆110Updated this week
- Apache Hive Metastore as a Standalone server in Docker☆73Updated 8 months ago