Pythonic Programming Framework to orchestrate jobs in Databricks Workflow
☆227Jun 8, 2026Updated this week
Alternatives and similar repositories for brickflow
Users that are interested in brickflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python Library to support running data quality rules while the spark job is running⚡☆202May 19, 2026Updated 3 weeks ago
- Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipeli…☆651May 6, 2026Updated last month
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆287Jun 3, 2026Updated last week
- Delta Lake helper methods in PySpark☆329Jan 19, 2026Updated 4 months ago
- PySpark test helper methods with beautiful error messages☆769May 20, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- pyspark methods to enhance developer productivity 📣 👯 🎉☆687Mar 6, 2025Updated last year
- Notebooks to learn Databricks Lakehouse Platform☆44Updated this week
- Notebook Discovery Tool for Databricks notebooks☆19Jul 14, 2022Updated 3 years ago
- ✨ A Pydantic to PySpark schema library☆127May 24, 2026Updated 3 weeks ago
- Testing framework for Databricks notebooks☆316Apr 20, 2024Updated 2 years ago
- ☆18Aug 6, 2024Updated last year
- A web application for creating and managing Databricks cluster policies with an interactive UI, allowing users to configure policy attrib…☆17May 7, 2025Updated last year
- csv and flat-file sniffer built in Rust.☆45Jan 26, 2024Updated 2 years ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆31Feb 7, 2026Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Open, Multi-modal Catalog for Data & AI☆3,417Jun 5, 2026Updated last week
- ☆10May 24, 2022Updated 4 years ago
- Custom PySpark Connectors☆100Mar 3, 2026Updated 3 months ago
- Code snippets used in demos recorded for the blog.☆42Apr 30, 2026Updated last month
- A library that provides useful extensions to Apache Spark and PySpark.☆238Jun 5, 2026Updated last week
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆27Mar 25, 2024Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Jan 27, 2024Updated 2 years ago
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Dec 11, 2023Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A simple and easy to use Data Quality (DQ) tool built with Python.☆51Sep 7, 2023Updated 2 years ago
- Spark style guide☆270Sep 30, 2024Updated last year
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Jan 27, 2025Updated last year
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 11 months ago
- Delta Lake Documentation☆53Jun 19, 2024Updated last year
- Delta lake and filesystem helper methods☆51Feb 29, 2024Updated 2 years ago
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆25Apr 17, 2026Updated last month
- Apache Spark Connect Client for Rust☆116Jun 10, 2025Updated last year
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆57Jul 4, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Column-wise type annotations for pyspark DataFrames☆107Jun 2, 2026Updated last week
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆50Apr 28, 2026Updated last month
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated 2 years ago
- SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.☆26Feb 22, 2025Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Furnace is a high-performance quantitative trading library that provides features similar to CCXT, allowing developers to connect and int…☆16Jan 16, 2025Updated last year
- Turning PySpark Into a Universal DataFrame API☆510Updated this week