dagster-io / dagsterLinks
An orchestration platform for the development, production, and observation of data assets.
☆13,335Updated this week
Alternatives and similar repositories for dagster
Users that are interested in dagster are comparing it to the libraries listed below
Sorting:
- the portable Python dataframe library☆5,844Updated this week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆10,965Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆19,505Updated this week
- Build data pipelines, the easy way 🛠️☆4,123Updated 2 years ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆40,495Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆18,417Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,287Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆8,921Updated this week
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆3,707Updated this week
- Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.☆27,405Updated last week
- Python SQL Parser and Transpiler☆7,841Updated this week
- Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to wr…☆2,101Updated this week
- Business intelligence as code: build fast, interactive data visualizations in SQL and markdown☆5,273Updated last week
- DuckDB is an analytical in-process SQL database management system☆30,169Updated this week
- Always know what to expect from your data.☆10,471Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,555Updated this week
- Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.☆2,119Updated this week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,382Updated this week
- 🦉 Data Versioning and ML Experiments☆14,542Updated this week
- lakeFS - Data version control for your data lake | Git for data☆4,715Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,334Updated 3 weeks ago
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,370Updated this week
- Compare tables within or across databases☆2,970Updated last year
- The Open Source Feature Store for AI/ML☆6,125Updated this week
- Python Stream Processing☆6,796Updated 10 months ago
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,591Updated last week
- A light-weight, flexible, and expressive statistical data testing library☆3,847Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,187Updated last week
- Self-serve BI to 10x your data team ⚡️☆4,785Updated this week
- Memray is a memory profiler for Python☆14,022Updated last week