dagster-io/dagster

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dagster-io/dagster)

dagster-io / dagster

An orchestration platform for the development, production, and observation of data assets.

☆15,901

Alternatives and similar repositories for dagster

Users that are interested in dagster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PrefectHQ / prefect
View on GitHub
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆23,490Updated this week
dbt-labs / dbt-core
View on GitHub
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…
☆13,516Updated this week
airbytehq / airbyte
View on GitHub
Open-source data movement for ELT pipelines and AI agents — from APIs, databases & files to warehouses, lakes, and AI applications. Both …
☆21,702Updated this week
fivetran / great_expectations
View on GitHub
Always know what to expect from your data.
☆11,670Updated this week
apache / airflow
View on GitHub
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
☆46,258Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pola-rs / polars
View on GitHub
Extremely fast Query Engine for DataFrames, written in Rust
☆39,109Updated this week
dlt-hub / dlt
View on GitHub
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
☆5,666Updated this week
mage-ai / mage-ai
View on GitHub
🧙 Build, run, and manage data pipelines for integrating and transforming data.
☆8,779Jul 17, 2026Updated last week
duckdb / duckdb
View on GitHub
DuckDB is an analytical in-process SQL database management system
☆39,724Updated this week
flyteorg / flyte
View on GitHub
Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.
☆7,149Updated this week
SQLMesh / sqlmesh
View on GitHub
Scalable and efficient data transformation framework - backwards compatible with dbt.
☆3,223Updated this week
kedro-org / kedro
View on GitHub
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…
☆10,935Updated this week
Netflix / metaflow
View on GitHub
Build, Manage and Deploy AI/ML Systems
☆10,196Updated this week
ibis-project / ibis
View on GitHub
the portable Python dataframe library
☆6,610Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,780Jul 1, 2026Updated 3 weeks ago
apache / superset
View on GitHub
Apache Superset is a Data Visualization and Data Exploration Platform
☆73,995Updated this week
meltano / meltano
View on GitHub
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…
☆2,570Updated this week
streamlit / streamlit
View on GitHub
Streamlit — A faster way to build and share data apps.
☆45,357Updated this week
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,348Updated this week
sqlfluff / sqlfluff
View on GitHub
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
☆9,827Updated this week
dask / dask
View on GitHub
Parallel computing with task scheduling
☆13,871Updated this week
mlflow / mlflow
View on GitHub
The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, a…
☆27,217Updated this week
ray-project / ray
View on GitHub
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
☆43,356Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cube-js / cube
View on GitHub
📊 Cube Core is open-source semantic layer for AI, BI and embedded analytics
☆20,495Updated this week
tobymao / sqlglot
View on GitHub
Python SQL Parser and Transpiler
☆9,462Updated this week
treeverse / dvc
View on GitHub
🦉 Data Versioning and ML Experiments
☆15,775Updated this week
spotify / luigi
View on GitHub
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…
☆18,752Jul 18, 2026Updated last week
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,925Updated this week
evidence-dev / evidence
View on GitHub
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
☆6,774Feb 18, 2026Updated 5 months ago
unionai-oss / pandera
View on GitHub
A light-weight, flexible, and expressive statistical data testing library
☆4,411Jul 18, 2026Updated last week
pydantic / pydantic
View on GitHub
Data validation using Python type hints
☆28,401Updated this week
modin-project / modin
View on GitHub
Modin: Scale your Pandas workflows by changing a single line of code
☆10,393Feb 10, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
metabase / metabase
View on GitHub
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data
☆48,376Updated this week
trinodb / trino
View on GitHub
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
☆13,074Updated this week
astral-sh / ruff
View on GitHub
An extremely fast Python linter and code formatter, written in Rust.
☆48,851Updated this week
feast-dev / feast
View on GitHub
The Open Source Feature Store for AI/ML
☆7,174Updated this week
getredash / redash
View on GitHub
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
☆28,721Jul 9, 2026Updated 2 weeks ago
lightdash / lightdash
View on GitHub
Agentic BI. Analytics at the speed of code ⚡️
☆5,982Updated this week
temporalio / temporal
View on GitHub
Temporal service
☆21,850Updated this week