PrefectHQ / prefectLinks
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
☆19,802Updated this week
Alternatives and similar repositories for prefect
Users that are interested in prefect are comparing it to the libraries listed below
Sorting:
- An orchestration platform for the development, production, and observation of data assets.☆13,567Updated last week
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆11,109Updated this week
- Always know what to expect from your data.☆10,574Updated this week
- Build, Manage and Deploy AI/ML Systems☆8,966Updated this week
- the portable Python dataframe library☆5,923Updated this week
- The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lak…☆18,871Updated this week
- A modular SQL linter and auto-formatter with support for multiple dialects and templated code.☆9,020Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,438Updated this week
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.☆6,366Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆41,024Updated this week
- Parallel computing with task scheduling☆13,337Updated last week
- Build data pipelines, the easy way 🛠️☆4,131Updated 2 years ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆18,374Updated 2 months ago
- 🧙 Build, run, and manage data pipelines for integrating and transforming data.☆8,419Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,409Updated 9 months ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,594Updated last month
- Dolt – Git for Data☆18,891Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆3,916Updated this week
- DuckDB is an analytical in-process SQL database management system☆31,018Updated this week
- 🦉 Data Versioning and ML Experiments☆14,672Updated this week
- Python Stream Processing☆6,807Updated 11 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,223Updated last week
- Voilà turns Jupyter notebooks into standalone web applications☆5,757Updated last week
- Streamlit — A faster way to build and share data apps.☆40,409Updated this week
- An open source multi-tool for exploring and publishing data☆10,189Updated last month
- A modern Python package and dependency manager supporting the latest PEP standards☆8,422Updated last week
- A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with gi…☆14,318Updated this week
- Panel: The powerful data exploration & web app framework for Python☆5,301Updated last week
- The Open Source Feature Store for AI/ML☆6,215Updated this week
- Python composable command line interface toolkit☆16,640Updated this week