capitalone / rubicon-ml
Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!
☆132Updated last week
Alternatives and similar repositories for rubicon-ml:
Users that are interested in rubicon-ml are comparing it to the libraries listed below
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆141Updated 3 months ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆77Updated last year
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 11 months ago
- Tools for making Prefect work better for typical data science workflows☆18Updated 3 years ago
- Dask integration for Snowflake☆30Updated 5 months ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆84Updated last year
- RFC document, tooling and other content related to the dataframe API standard☆108Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆500Updated 3 months ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 7 months ago
- Type System for Data Analysis in Python☆212Updated 3 months ago
- DataFrame support for scikit-learn.☆63Updated last year
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- fsspec-compatible Azure Datake and Azure Blob Storage access☆189Updated 4 months ago
- Kedro Plugin to support running workflows on GCP Vertex AI Pipelines☆36Updated this week
- implementation of Cyclic Boosting machine learning algorithms☆88Updated 8 months ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆66Updated last year
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- Coming soon☆61Updated last year
- Woodwork is a Python library that provides robust methods for managing and communicating data typing information.☆153Updated last week
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- A unified wrapper for various ML frameworks - to have one uniform scikit-learn format for predict and predict_proba functions.☆48Updated 3 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago