A DataOps framework for building a lakehouse.
☆56Mar 7, 2026Updated this week
Alternatives and similar repositories for laktory
Users that are interested in laktory are comparing it to the libraries listed below
Sorting:
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- ☆18Aug 6, 2024Updated last year
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- ☆15May 31, 2023Updated 2 years ago
- Configure and enforce conventions for your dbt project.☆93Updated this week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆282Updated this week
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆38Nov 13, 2024Updated last year
- ☆30Dec 4, 2024Updated last year
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Mar 29, 2023Updated 2 years ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆187Mar 1, 2026Updated last week
- A ✨blazingly fast✨ builder for Azure Functions powered by esbuild.☆12Jun 18, 2025Updated 8 months ago
- An Azure Data Factory pipeline with T-SQL and REST API to backup an Azure Synapse provisioned SQL Pool to Azure BLOB storage.☆14Dec 7, 2020Updated 5 years ago
- Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs☆42Feb 23, 2026Updated 2 weeks ago
- OPI5 open micro desk design.☆13Mar 6, 2023Updated 3 years ago
- QGIS 3 plugin☆11Jan 3, 2021Updated 5 years ago
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆14Sep 30, 2024Updated last year
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆46Jan 27, 2025Updated last year
- Python interface to the FDIC's API for publically available bank data☆12Apr 15, 2023Updated 2 years ago
- Utility functions for dbt projects running on Athena☆12Mar 25, 2025Updated 11 months ago
- A unit test framework for Databricks notebooks☆12Dec 8, 2020Updated 5 years ago
- Smart Tensors Tutorials☆10Oct 19, 2025Updated 4 months ago
- ☆12Sep 18, 2025Updated 5 months ago
- Data Package of ratification status of the Paris Climate Agreement and the emissions shares used for entry into force☆14Feb 13, 2023Updated 3 years ago
- Raspberry Pi 4 Image☆12Oct 25, 2024Updated last year
- ☆10Aug 22, 2021Updated 4 years ago
- ☆12Jul 8, 2025Updated 8 months ago
- Examples on how to use terrabyte for Earth Observation datasets☆11Dec 11, 2025Updated 2 months ago
- A php library for working with Data Package.☆10Jan 30, 2026Updated last month
- 🏃♀️ Minimalist SQL orchestrator☆307Updated this week
- Starlight plugin to quickly and easily document keyboard shortcuts☆15Nov 26, 2025Updated 3 months ago
- Sphero SDK to run on Arduino using C++☆11Dec 21, 2019Updated 6 years ago
- All the useful tools I have been using while working in data science for remote sensing☆11Nov 27, 2019Updated 6 years ago
- ☆16Jul 25, 2025Updated 7 months ago
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 3 months ago
- Typed, annotated vectors for well-documented datasets☆11Jan 30, 2026Updated last month
- Info about the Nashville tech community☆11Mar 25, 2025Updated 11 months ago
- Nuance Mix Demo Client for use with Azure Static Web Apps☆14Nov 30, 2023Updated 2 years ago
- Camtraptor is an R package to read, explore and visualize Camera Trap Data Packages (Camtrap DP)☆13Updated this week