capeprivacy / cape-dataframes
Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
☆173Updated last year
Alternatives and similar repositories for cape-dataframes:
Users that are interested in cape-dataframes are comparing it to the libraries listed below
- PipelineDP is a Python framework for applying differentially private aggregations to large datasets using batch processing systems such a…☆274Updated 2 months ago
- Python language bindings for smartnoise-core.☆76Updated 2 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineering☆40Updated 4 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 10 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated 11 months ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆180Updated 8 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- A command line tool to easily add an ethics checklist to your data science projects.☆292Updated 8 months ago
- MLOps simplified. One platform, all the functionality you need. Swiss made☆98Updated this week
- Differential privacy validator and runtime☆291Updated 3 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆169Updated last year
- A library of Reversible Data Transforms☆124Updated this week
- ☆26Updated 4 years ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆142Updated 7 months ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆105Updated last year
- openclean - Data Cleaning and data profiling library for Python☆74Updated 3 years ago
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆101Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stan…☆53Updated last year
- 🎲 A curated list of MLOps projects, tools and resources☆186Updated 11 months ago
- Type System for Data Analysis in Python☆211Updated last month
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- A hands-on tutorial showing how to use Python to do anonymisation with synthetic data☆78Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- python automatic data quality check toolkit☆283Updated 4 years ago
- A privacy-preserving app for comparing last-known locations of coronavirus patients☆44Updated last year
- Capturing model drift and handling its response - Example webinar☆107Updated 5 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- A workshop on data privacy methods for data scientists.☆69Updated 2 years ago