kedro-org / kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
β10,158Updated this week
Alternatives and similar repositories for kedro:
Users that are interested in kedro are comparing it to the libraries listed below
- Open Source AI/ML Platformβ8,535Updated this week
- π¦ Data Versioning and ML Experimentsβ14,156Updated this week
- Always know what to expect from your data.β10,193Updated this week
- Build data pipelines, the easy way π οΈβ4,110Updated last year
- The Open Source Feature Store for AI/MLβ5,813Updated this week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,065Updated this week
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,542Updated 5 months ago
- Open source platform for the machine learning lifecycleβ19,486Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,335Updated 4 months ago
- A light-weight, flexible, and expressive statistical data testing libraryβ3,617Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β5,702Updated this week
- ZenML π: The bridge between ML and Ops. https://zenml.io.β4,413Updated this week
- Visualise your Kedro data and machine-learning pipelines and track your experiments.β698Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,040Updated 4 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning modelsβ4,448Updated this week
- Parallel computing with task schedulingβ12,934Updated this week
- Panel: The powerful data exploration & web app framework for Pythonβ5,031Updated this week
- the portable Python dataframe libraryβ5,514Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,581Updated 2 weeks ago
- Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.β6,003Updated this week
- A Python library that helps data scientists to infer causation rather than observing correlation.β2,281Updated 7 months ago
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β7,374Updated this week
- Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analyβ¦β5,633Updated last week
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β5,821Updated this week
- Algorithms for outlier, adversarial and drift detectionβ2,301Updated 3 weeks ago
- π Parameterize, execute, and analyze notebooksβ6,080Updated last month
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,705Updated 2 months ago
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadminβ6,271Updated 2 months ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.β8,560Updated this week
- Visualize and compare datasets, target values and associations, with one line of code.β2,981Updated 6 months ago