kedro-org / kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
β10,154Updated this week
Alternatives and similar repositories for kedro:
Users that are interested in kedro are comparing it to the libraries listed below
- Build data pipelines, the easy way π οΈβ4,106Updated last year
- Open Source AI/ML Platformβ8,524Updated this week
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,541Updated 4 months ago
- π¦ Data Versioning and ML Experimentsβ14,146Updated this week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,063Updated this week
- A light-weight, flexible, and expressive statistical data testing libraryβ3,610Updated this week
- Always know what to expect from your data.β10,189Updated this week
- Create delightful software with Jupyter Notebooksβ5,012Updated last week
- The Open Source Feature Store for Machine Learningβ5,804Updated this week
- ZenML π: The bridge between ML and Ops. https://zenml.io.β4,391Updated this week
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.β8,553Updated this week
- An orchestration platform for the development, production, and observation of data assets.β12,501Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,572Updated last week
- π Parameterize, execute, and analyze notebooksβ6,075Updated last month
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β18,239Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β5,683Updated this week
- Visualise your Kedro data and machine-learning pipelines and track your experiments.β696Updated this week
- π Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Modelsβ2,772Updated last month
- Automatically visualize your pandas dataframe via a single print! π π‘β5,244Updated 10 months ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applicationβ¦β10,352Updated this week
- Streamlit β A faster way to build and share data apps.β37,299Updated this week
- Open source platform for the machine learning lifecycleβ19,460Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visβ¦β18,078Updated last week
- Parallel computing with task schedulingβ12,923Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,041Updated 4 months ago
- Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.β2,352Updated last month
- π§ Build, run, and manage data pipelines for integrating and transforming data.β8,134Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,334Updated 4 months ago
- Clean APIs for data cleaning. Python implementation of R package Janitorβ1,394Updated this week
- Panel: The powerful data exploration & web app framework for Pythonβ5,020Updated this week