capitalone / rubicon-mlLinks
Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!
☆132Updated last week
Alternatives and similar repositories for rubicon-ml
Users that are interested in rubicon-ml are comparing it to the libraries listed below
Sorting:
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆109Updated 5 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆153Updated last month
- An abstraction layer for parameter tuning☆35Updated 9 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated 9 months ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆225Updated 4 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 3 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- First-party plugins maintained by the Kedro team.☆103Updated this week
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated last year
- Dask integration for Snowflake☆30Updated 6 months ago
- DataFrame support for scikit-learn.☆63Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆76Updated 5 months ago
- 💫 PyScaffold extension for data-science projects☆159Updated 2 months ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆141Updated this week
- fsspec-compatible Azure Datake and Azure Blob Storage access☆191Updated 5 months ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- Summarise and explore Pandas DataFrames☆98Updated 4 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- Primrose modeling framework for simple production models☆32Updated last year
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Coming soon☆61Updated last year
- The easiest way to integrate Kedro and Great Expectations☆52Updated 2 years ago
- implementation of Cyclic Boosting machine learning algorithms☆89Updated 9 months ago
- SciKIt-learn Pipeline in PAndas☆42Updated last year