DataResponsibly / MirrorDataGeneratorLinks
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆23Updated 3 years ago
Alternatives and similar repositories for MirrorDataGenerator
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
Sorting:
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆59Updated this week
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆82Updated 2 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆24Updated last year
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 6 months ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆25Updated 3 months ago
- Leverage your LangChain trace data for fine tuning☆42Updated 11 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- ☆24Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 4 months ago
- A library to use `modal` as a backend for `joblib`.☆29Updated 6 months ago
- A project comparing the implementations of a basic AI agent using Langchain and PydanticAI frameworks☆14Updated 5 months ago
- The Gretel Python Client allows you to interact with the Gretel REST API.☆56Updated this week
- Writing Blog Posts with Generative Feedback Loops!☆49Updated last year
- ☆24Updated 2 years ago
- ☆21Updated 7 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆38Updated last year
- a convenient way to anonymize your data for analytics☆22Updated 3 years ago
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Updated 9 months ago
- Generating Realistic Synthetic Data☆39Updated last year
- Compound AI toolchain for fast and accurate entity matching, powered by LLMs.☆23Updated 4 months ago
- Example Code to Supplement the Label Studio Blog☆26Updated 2 weeks ago
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Have UV deal with all your Jupyter deps.☆27Updated 10 months ago
- Demo on how to use Prefect with Docker☆26Updated 2 years ago
- ☆22Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- Transforming textual descriptions into process models using deep learning☆14Updated 6 years ago
- A variation on a standard Decision Tree such as that in sklearn, where nodes may be based on an aggregation of multiple splits.☆9Updated last year
- LLM plugin for clustering embeddings☆77Updated last year