kedro-org / kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
☆10,243Updated this week
Alternatives and similar repositories for kedro:
Users that are interested in kedro are comparing it to the libraries listed below
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,559Updated 6 months ago
- Always know what to expect from your data.☆10,297Updated this week
- Build, Manage and Deploy AI/ML Systems☆8,689Updated this week
- Open source platform for the machine learning lifecycle☆19,972Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,360Updated 5 months ago
- An orchestration platform for the development, production, and observation of data assets.☆12,826Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,115Updated 2 months ago
- A light-weight, flexible, and expressive statistical data testing library☆3,720Updated this week
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆5,950Updated this week
- ♾️ CML - Continuous Machine Learning | CI/CD for ML☆4,086Updated this week
- Visualise your Kedro data and machine-learning pipelines and track your experiments.☆707Updated this week
- ZenML 🙏: The bridge between ML and Ops. https://zenml.io.☆4,500Updated this week
- 🦉 Data Versioning and ML Experiments☆14,324Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆18,797Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,494Updated this week
- Modin: Scale your Pandas workflows by changing a single line of code☆10,095Updated this week
- the portable Python dataframe library☆5,648Updated this week
- Parallel computing with task scheduling☆13,083Updated last week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,816Updated last week
- Clean APIs for data cleaning. Python implementation of R package Janitor☆1,406Updated this week
- cuDF - GPU DataFrame Library☆8,837Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,059Updated this week
- Voilà turns Jupyter notebooks into standalone web applications☆5,634Updated last month
- The Open Source Feature Store for AI/ML☆5,899Updated this week
- An open source python library for automated feature engineering☆7,405Updated 2 weeks ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆8,696Updated this week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆19,022Updated 5 months ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,331Updated last month
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!☆7,561Updated this week
- Create delightful software with Jupyter Notebooks☆5,052Updated this week