A DataOps framework for building a lakehouse.
☆57May 15, 2026Updated this week
Alternatives and similar repositories for laktory
Users that are interested in laktory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 6, 2024Updated last year
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- PySpark schema generator☆44Feb 23, 2023Updated 3 years ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated last year
- Configure and enforce conventions for your dbt project.☆109Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Griffe extension to inject field metadata into mkdocstrings (supports dataclasses, pydantic, attrs, and more)☆24May 4, 2026Updated 2 weeks ago
- An Azure Data Factory pipeline with T-SQL and REST API to backup an Azure Synapse provisioned SQL Pool to Azure BLOB storage.☆14Dec 7, 2020Updated 5 years ago
- Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.☆202Updated this week
- ☆30Dec 4, 2024Updated last year
- A configuration-driven framework for building Dagster pipelines that enables teams to create and manage data workflows using YAML/JSON in…☆37Nov 13, 2024Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆288Mar 4, 2026Updated 2 months ago
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆21Mar 16, 2021Updated 5 years ago
- Flexible Python package for managing and extending LLM based agents☆24May 14, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆45Mar 7, 2024Updated 2 years ago
- This is a showcase repository for the multi-genie agent solution☆24Feb 22, 2026Updated 2 months ago
- This MATLAB code implements the binary Grass hopper optimization algorithm to select the features and train with KNN☆12Apr 3, 2019Updated 7 years ago
- 🏃♀️ Minimalist SQL orchestrator☆323May 12, 2026Updated last week
- A collection of airflow sample workflows for data processing on aws☆12Dec 1, 2017Updated 8 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 5 months ago
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- Meta data driven spark notebooks, for loading data in Microsoft Fabric☆13Jul 21, 2024Updated last year
- Testing framework for Databricks notebooks☆315Apr 20, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Samples on how call a REST endpoint using Azure SQL Database☆37Feb 5, 2026Updated 3 months ago
- dbt integration for Cube☆16Oct 22, 2025Updated 6 months ago
- Easy application configuration with python☆11Feb 11, 2026Updated 3 months ago
- Building event-driven data ingestion pipelines in Azure☆16Apr 27, 2023Updated 3 years ago
- open-blazorui is a simple UI for your local LLMs.☆19Feb 20, 2025Updated last year
- Integrates Googles reCAPTCHA into October.☆14Apr 24, 2024Updated 2 years ago
- Repo for demo code (SQL, PowerShell, Docker, etc.)☆12Nov 18, 2022Updated 3 years ago
- Discord integration for Livebook☆17Jan 20, 2023Updated 3 years ago
- The simple, fast, and scalable code generator that lives in your project.☆14Sep 3, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Click-to-component functionality for LiveView apps.☆13Oct 18, 2023Updated 2 years ago
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆60Mar 29, 2023Updated 3 years ago
- ☆15May 18, 2022Updated 4 years ago
- Telegram bot using GPT4 API☆15Aug 26, 2024Updated last year
- OCB, the Bootstrapper and Apache docker container for October CMS☆12Jun 25, 2024Updated last year
- Parsing Module of Microsoft SQL Server Transaction log☆12May 12, 2023Updated 3 years ago