dagster-io / dagster
An orchestration platform for the development, production, and observation of data assets.
☆13,089Updated this week
Alternatives and similar repositories for dagster:
Users that are interested in dagster are comparing it to the libraries listed below
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,790Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,230Updated this week
- Always know what to expect from your data.☆10,376Updated this week
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆3,567Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆18,102Updated this week
- Build, Manage and Deploy AI/ML Systems☆8,776Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,293Updated this week
- the portable Python dataframe library☆5,731Updated this week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆2,083Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,053Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,566Updated 2 weeks ago
- Fastest library to load data from DB to DataFrames in Rust and Python☆2,245Updated this week
- Dataframes powered by a multithreaded, vectorized query engine, written in Rust☆33,563Updated this week
- Distributed data engine for Python/SQL designed for the cloud, powered by Rust☆2,808Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆39,979Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆8,849Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆3,788Updated this week
- DuckDB is an analytical in-process SQL database management system☆28,998Updated this week
- Python SQL Parser and Transpiler☆7,670Updated this week
- The Metadata Platform for your Data and AI Stack☆10,575Updated this week
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,310Updated this week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,563Updated last year
- 📚 Parameterize, execute, and analyze notebooks☆6,153Updated last month
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,218Updated this week
- The Open Source Feature Store for AI/ML☆6,041Updated this week
- A native Rust library for Delta Lake, with bindings into Python☆2,731Updated this week
- Self-serve BI to 10x your data team ⚡️☆4,702Updated this week
- Compare tables within or across databases☆2,969Updated 11 months ago
- Build data pipelines, the easy way 🛠️☆4,119Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,078Updated last month