PacktPublishing / Managing-Data-as-a-Product
Managing Data as a Product, published by Packt
☆15Updated 4 months ago
Alternatives and similar repositories for Managing-Data-as-a-Product:
Users that are interested in Managing-Data-as-a-Product are comparing it to the libraries listed below
- ☆34Updated 11 months ago
- Data product portal created by Dataminded☆183Updated this week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆23Updated last year
- Streaming demo dbt☆17Updated 7 months ago
- ☆29Updated 9 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆111Updated 3 weeks ago
- A portable Datamart and Business Intelligence suite built with Docker, sqlmesh + dbtcore, DuckDB and Superset☆49Updated 5 months ago
- Fabric Python Notebooks examples☆68Updated 2 weeks ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 2 months ago
- PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows☆43Updated 9 months ago
- ☆15Updated last year
- A DataOps framework for building a lakehouse.☆50Updated last week
- Ready-to-use code snippets for building interactive data applications using Databricks Apps.☆75Updated last week
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆141Updated 3 weeks ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Content published on social channels☆17Updated 2 weeks ago
- The Data Product Specification☆10Updated 2 months ago
- Open Data Product Specification with examples☆9Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- The Data Product Descriptor Specification (DPDS) Repository☆77Updated 3 months ago
- A python SPark ETL libRary (SPETLR) for Databricks. https://discord.gg/p9bzqGybVW☆21Updated last month
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆49Updated last week
- Personal project for setting up an open source data warehouse.☆29Updated 2 months ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆27Updated 3 months ago
- ☆28Updated last month
- example of a Microsoft Fabric Solution☆31Updated 8 months ago
- A platform to manage the data product life cycle☆16Updated last week
- csv and flat-file sniffer built in Rust.☆42Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago