kedro-org / kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
☆10,228Updated this week
Alternatives and similar repositories for kedro:
Users that are interested in kedro are comparing it to the libraries listed below
- Always know what to expect from your data.☆10,273Updated this week
- Build, Deploy and Manage AI/ML Systems☆8,644Updated this week
- 📚 Parameterize, execute, and analyze notebooks☆6,113Updated 2 months ago
- ♾️ CML - Continuous Machine Learning | CI/CD for ML☆4,075Updated this week
- Create delightful software with Jupyter Notebooks☆5,040Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆3,688Updated 2 weeks ago
- Visualise your Kedro data and machine-learning pipelines and track your experiments.☆706Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,356Updated 5 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,064Updated this week
- An orchestration platform for the development, production, and observation of data assets.☆12,772Updated this week
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,557Updated 6 months ago
- An open source python library for automated feature engineering☆7,384Updated this week
- Declarative visualization library for Python☆9,651Updated 2 weeks ago
- The Open Source Feature Store for AI/ML☆5,874Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆39,217Updated this week
- Build data pipelines, the easy way 🛠️☆4,113Updated last year
- the portable Python dataframe library☆5,608Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,799Updated this week
- A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.☆1,958Updated 2 weeks ago
- Visualizer for pandas data structures☆4,872Updated this week
- 🦉 Data Versioning and ML Experiments☆14,282Updated 2 weeks ago
- Voilà turns Jupyter notebooks into standalone web applications☆5,620Updated 2 weeks ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,055Updated 6 months ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆8,655Updated this week
- Fit interpretable models. Explain blackbox machine learning.☆6,426Updated this week
- cuDF - GPU DataFrame Library☆8,776Updated this week
- Curated list of resources about Apache Airflow☆3,750Updated 7 months ago
- A flexible, intuitive and fast forecasting library☆1,830Updated last month
- A simple and efficient tool to parallelize Pandas operations on all available CPUs☆3,730Updated 8 months ago
- A Python library that helps data scientists to infer causation rather than observing correlation.☆2,290Updated 8 months ago