Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
☆2,533Jun 26, 2026Updated this week
Alternatives and similar repositories for hamilton
Users that are interested in hamilton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆2,431Jun 22, 2026Updated last week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆860Jul 3, 2023Updated 2 years ago
- A light-weight, flexible, and expressive statistical data testing library☆4,386Updated this week
- the portable Python dataframe library☆6,584Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆15,733Jun 18, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,151Jun 22, 2026Updated last week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,625May 29, 2025Updated last year
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,727Updated this week
- High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale☆5,571Jun 21, 2026Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆22,701Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,166May 19, 2026Updated last month
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆5,507Jun 22, 2026Updated last week
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.☆7,119Updated this week
- A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with gi…☆21,610Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python Stream Processing☆2,026Jun 20, 2026Updated last week
- A lightweight logger for machine learning teams to log images and predictions in production.☆154May 3, 2023Updated 3 years ago
- Simple Workflow Framework based on Hamilton☆24May 2, 2026Updated last month
- Lightweight and extensible compatibility layer between dataframe libraries!☆1,666Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,903Updated this week
- Build, Manage and Deploy AI/ML Systems☆10,143Updated this week
- Always know what to expect from your data.☆11,603Updated this week
- Python SQL Parser and Transpiler☆9,352Jun 22, 2026Updated last week
- Turning PySpark Into a Universal DataFrame API☆522Jun 18, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Postgres-native columnar storage extension☆3,030Feb 10, 2025Updated last year
- The Open Source Feature Store for AI/ML☆7,102Jun 22, 2026Updated last week
- an ambient intelligence library☆6,174May 12, 2026Updated last month
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆6,493Feb 18, 2026Updated 4 months ago
- Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.☆2,547Jun 22, 2026Updated last week
- EZQL ask your database questions using natural language.☆134Jan 17, 2025Updated last year
- 💸 The Map3 Supercharge SDK connects crypto apps to Wallets, Exchanges & Bridges, enabling cross-chain deposits and increasing volumes.☆99Jun 23, 2023Updated 3 years ago
- UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured ch…☆2,352Aug 18, 2024Updated last year
- Extremely fast Query Engine for DataFrames, written in Rust☆38,879Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆10,707Updated this week
- BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is …☆200Jan 10, 2024Updated 2 years ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,391Feb 10, 2026Updated 4 months ago
- A data modelling layer built on top of polars and pydantic☆629May 8, 2026Updated last month
- structured outputs for llms☆13,210Updated this week
- Structured Outputs☆13,984Jun 19, 2026Updated last week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,508Updated this week