capitalone / rubicon-ml
Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!
☆132Updated 4 months ago
Alternatives and similar repositories for rubicon-ml:
Users that are interested in rubicon-ml are comparing it to the libraries listed below
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆76Updated last year
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 3 months ago
- A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)☆12Updated 4 years ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆139Updated 2 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Type System for Data Analysis in Python☆211Updated last month
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- First-party plugins maintained by the Kedro team.☆98Updated last week
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆28Updated 2 years ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- An abstraction layer for parameter tuning☆35Updated 6 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆105Updated last year
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆83Updated last year
- RFC document, tooling and other content related to the dataframe API standard☆106Updated 11 months ago
- Assessing whether data from database complies with reference information.☆42Updated 2 weeks ago
- Dask integration for Snowflake☆30Updated 4 months ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- ☆21Updated 7 months ago
- Coming soon☆60Updated last year
- Tools for making Prefect work better for typical data science workflows☆19Updated 3 years ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆499Updated 2 months ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated 2 years ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- ☆55Updated last year
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago