AlexFrid / anonymizedf
a convenient way to anonymize your data for analytics
☆22Updated 3 years ago
Alternatives and similar repositories for anonymizedf
Users that are interested in anonymizedf are comparing it to the libraries listed below
Sorting:
- Build your feature store with macros right within your dbt repository☆38Updated 2 years ago
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- An abstraction layer for parameter tuning☆35Updated 8 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A scikit-learn compatible estimator based on business-rules with interactive dashboard included☆28Updated 3 years ago
- real-time data + ML pipeline☆54Updated last month
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib☆19Updated 3 years ago
- Render Jupyter Notebooks With Metaflow Cards☆29Updated 7 months ago
- A very simple "hello world" project for deploying Prefect 2 to a docker container on Google Compute Engine.☆11Updated 2 years ago
- Pre-processing database using pre-written functions☆20Updated 4 years ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- Example of configuring multiplage apps via a custom config file☆18Updated last year
- ☆39Updated 3 months ago
- Cost Efficient Data Pipelines with DuckDB☆52Updated 9 months ago
- Demo on how to use Prefect 2 in an ML project☆41Updated 2 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Model drift detection☆11Updated last year
- This is a demo of a dataframe with editable cells, powered by `streamlit-aggrid` from Pablo Fonseca. You can edit the cells by clicking o…☆44Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆23Updated 2 years ago
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Updated 3 months ago
- ☆29Updated last year
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- ☆21Updated 8 months ago
- Create a local dashboard to visualize and filter your GitHub feed☆29Updated 2 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 2 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 3 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago