DataResponsibly / MirrorDataGenerator
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆23Updated 2 years ago
Alternatives and similar repositories for MirrorDataGenerator:
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆24Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- LLM application tracing based on OpenTelemetry☆10Updated 2 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated last week
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆15Updated last week
- Fast implementations of common forecasting routines☆37Updated this week
- ☆22Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 9 months ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆23Updated 2 weeks ago
- examples and guides to using Nomic Atlas☆32Updated last week
- A tutorial on DSPy and whether automated prompt engineering lives up to the hype☆22Updated 11 months ago
- ☆17Updated 2 years ago
- Retrieval Augmented Generation applications☆26Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆22Updated last month
- This repository contains the implementation of evaluation metrics for recommendation systems. We have compared similarity, candidate gene…☆18Updated 2 months ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 3 months ago
- ☆43Updated 5 months ago
- GraphRag vs Embeddings☆13Updated 9 months ago
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆11Updated last month
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Updated 2 years ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆16Updated 11 months ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 3 months ago
- Finetune Llama 2 on Colab for free on your own data: step-by-step tutorial☆32Updated 11 months ago