apache/hamilton

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apache/hamilton)

apache / hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

☆2,552

Alternatives and similar repositories for hamilton

Users that are interested in hamilton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stitchfix / hamilton
View on GitHub
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
☆860Jul 3, 2023Updated 3 years ago
apache / burr
View on GitHub
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…
☆2,485Updated this week
unionai-oss / pandera
View on GitHub
A light-weight, flexible, and expressive statistical data testing library
☆4,409Updated this week
ibis-project / ibis
View on GitHub
the portable Python dataframe library
☆6,601Updated this week
dagster-io / dagster
View on GitHub
An orchestration platform for the development, production, and observation of data assets.
☆15,870Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ploomber / ploomber
View on GitHub
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
☆3,623May 29, 2025Updated last year
fugue-project / fugue
View on GitHub
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…
☆2,170May 19, 2026Updated 2 months ago
SQLMesh / sqlmesh
View on GitHub
Scalable and efficient data transformation framework - backwards compatible with dbt.
☆3,208Updated this week
Eventual-Inc / Daft
View on GitHub
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
☆5,644Updated this week
PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,437Updated this week
flyteorg / flyte
View on GitHub
Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.
☆7,146Updated this week
lance-format / lance
View on GitHub
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…
☆6,825Updated this week
narwhals-dev / narwhals
View on GitHub
Lightweight and extensible compatibility layer between dataframe libraries!
☆1,685Updated this week
dlt-hub / dlt
View on GitHub
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
☆5,632Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
thebabylonai / babylog
View on GitHub
A lightweight logger for machine learning teams to log images and predictions in production.
☆154May 3, 2023Updated 3 years ago
bytewax / bytewax
View on GitHub
Python Stream Processing
☆2,036Jun 20, 2026Updated last month
marimo-team / marimo
View on GitHub
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with gi…
☆21,919Updated this week
legout / flowerpower
View on GitHub
Simple Workflow Framework based on Hamilton
☆24Jul 13, 2026Updated last week
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,926Updated this week
Netflix / metaflow
View on GitHub
Build, Manage and Deploy AI/ML Systems
☆10,188Updated this week
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,658Updated this week
outerbase / ezql
View on GitHub
EZQL ask your database questions using natural language.
☆135Jan 17, 2025Updated last year
uptrain-ai / uptrain
View on GitHub
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured ch…
☆2,355Aug 18, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tobymao / sqlglot
View on GitHub
Python SQL Parser and Transpiler
☆9,439Updated this week
eakmanrq / sqlframe
View on GitHub
Turning PySpark Into a Universal DataFrame API
☆526Updated this week
feast-dev / feast
View on GitHub
The Open Source Feature Store for AI/ML
☆7,142Updated this week
evidence-dev / evidence
View on GitHub
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
☆6,716Feb 18, 2026Updated 5 months ago
hydradatabase / columnar
View on GitHub
Postgres-native columnar storage extension
☆3,036Feb 10, 2025Updated last year
PrefectHQ / marvin
View on GitHub
an ambient intelligence library
☆6,180Updated this week
pola-rs / polars
View on GitHub
Extremely fast Query Engine for DataFrames, written in Rust
☆39,061Updated this week
map3xyz / supercharge
View on GitHub
💸 The Map3 Supercharge SDK connects crypto apps to Wallets, Exchanges & Bridges, enabling cross-chain deposits and increasing volumes.
☆99Jun 23, 2023Updated 3 years ago
griptape-ai / griptape
View on GitHub
Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.
☆2,559Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
modin-project / modin
View on GitHub
Modin: Scale your Pandas workflows by changing a single line of code
☆10,395Feb 10, 2026Updated 5 months ago
JakobGM / patito
View on GitHub
A data modelling layer built on top of polars and pydantic
☆633May 8, 2026Updated 2 months ago
launchflow / buildflow
View on GitHub
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is …
☆200Jan 10, 2024Updated 2 years ago
mage-ai / mage-ai
View on GitHub
🧙 Build, run, and manage data pipelines for integrating and transforming data.
☆8,772Updated this week
567-labs / instructor
View on GitHub
structured outputs for llms
☆13,579Jul 13, 2026Updated last week
malloydata / malloy
View on GitHub
Malloy is a modern open source language for describing data relationships and transformations.
☆2,522Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,252Updated this week