theodi / synthetic-data-tutorialLinks
A hands-on tutorial showing how to use Python to do anonymisation with synthetic data
☆79Updated 3 years ago
Alternatives and similar repositories for synthetic-data-tutorial
Users that are interested in synthetic-data-tutorial are comparing it to the libraries listed below
Sorting:
- ☆271Updated last year
- General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems☆194Updated this week
- Slides, videos and other potentially useful artifacts from various presentations on responsible machine learning.☆22Updated 5 years ago
- ☆37Updated 3 months ago
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 4 years ago
- ☆29Updated 6 years ago
- A library that implements fairness-aware machine learning algorithms☆126Updated 4 years ago
- A command line tool to easily add an ethics checklist to your data science projects.☆299Updated last year
- Explore 120 million taxi trips in real time with Dash and Vaex☆117Updated 4 years ago
- Capturing model drift and handling its response - Example webinar☆108Updated 6 years ago
- Recipes for Driverless AI☆251Updated last month
- Guide on creating an API for serving your ML model☆67Updated 3 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- python library for automated dataset normalization☆116Updated 2 years ago
- Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.☆175Updated 2 years ago
- Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OH☆156Updated 4 years ago
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning wo…☆171Updated 2 years ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- This is a repo for all the tutorials put out by H2O.ai. This includes learning paths for Driverless AI, H2O-3, Sparkling Water and more..…☆134Updated last year
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 6 years ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆284Updated 3 years ago
- Data Analysis Baseline Library☆133Updated 10 months ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- Trumania is a scenario-based random dataset generator library in python 3☆112Updated 3 years ago
- ☆96Updated 5 years ago
- How to use Python to understand data and transform the data into a tidy format ready to be used for modelling and visualisation.☆36Updated 6 years ago
- Code samples and documentation for SmartNoise differential privacy tools☆134Updated 3 years ago
- A short tutorial for data scientists on how to write tests for code + data.☆120Updated 5 years ago
- Record matching and entity resolution at scale in Spark☆35Updated last year
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago