kedro-org / kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
β10,243Updated this week
Alternatives and similar repositories for kedro:
Users that are interested in kedro are comparing it to the libraries listed below
- π Parameterize, execute, and analyze notebooksβ6,115Updated 2 months ago
- Build, Manage and Deploy AI/ML Systemsβ8,677Updated this week
- Visualise your Kedro data and machine-learning pipelines and track your experiments.β707Updated this week
- Always know what to expect from your data.β10,297Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,626Updated 3 weeks ago
- A light-weight, flexible, and expressive statistical data testing libraryβ3,711Updated this week
- Create delightful software with Jupyter Notebooksβ5,047Updated this week
- π¦ Data Versioning and ML Experimentsβ14,324Updated last week
- the portable Python dataframe libraryβ5,648Updated this week
- Build data pipelines, the easy way π οΈβ4,116Updated last year
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,095Updated this week
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,559Updated 6 months ago
- Open source platform for the machine learning lifecycleβ19,972Updated this week
- An orchestration platform for the development, production, and observation of data assets.β12,826Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,360Updated 5 months ago
- An open source python library for automated feature engineeringβ7,405Updated 2 weeks ago
- Streamlit β A faster way to build and share data apps.β38,446Updated this week
- Parallel computing with task schedulingβ13,083Updated last week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β18,797Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,059Updated this week
- Automatically visualize your pandas dataframe via a single print! π π‘β5,264Updated last year
- ZenML π: The bridge between ML and Ops. https://zenml.io.β4,491Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.β12,816Updated last week
- Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentationβ3,131Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycleβ3,620Updated last month
- Convert Jupyter Notebooks to Web Appsβ4,159Updated 3 months ago
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,086Updated this week
- A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.β1,959Updated 3 weeks ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scriptsβ6,795Updated last week
- Visualize and compare datasets, target values and associations, with one line of code.β3,001Updated 7 months ago