DataResponsibly / MirrorDataGenerator
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆21Updated 2 years ago
Alternatives and similar repositories for MirrorDataGenerator:
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
- This repository implements DSPy programs to tasks in Indian Languages☆13Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- examples and guides to using Nomic Atlas☆27Updated this week
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆23Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆20Updated 3 weeks ago
- GraphRag vs Embeddings☆13Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆23Updated 5 months ago
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆10Updated 3 weeks ago
- ☆12Updated 7 months ago
- ☆12Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Leverage your LangChain trace data for fine tuning☆41Updated 8 months ago
- Solve Geometric & Graph Problems with Large Language Models☆28Updated 2 years ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 2 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆10Updated 6 months ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 8 months ago
- Code for paper: "Privately generating tabular data using language models".☆15Updated last year
- ☆18Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- LMQL implementation of tree of thoughts☆34Updated last year
- Python package for extractive NLP using the OpenAI API☆17Updated 7 months ago
- portable Python ML-powered data bot☆23Updated 6 months ago
- ☆29Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- ☆20Updated last year