DataResponsibly / MirrorDataGeneratorLinks
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆25Updated 3 years ago
Alternatives and similar repositories for MirrorDataGenerator
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
Sorting:
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆85Updated 2 weeks ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆19Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆25Updated last year
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated last year
- ☆24Updated 2 years ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
- Streamlit app for recommending eval functions using prompt diffs☆30Updated 2 years ago
- Retrieval Augmented Generation applications☆26Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 10 months ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆31Updated 9 months ago
- Python package for extractive NLP using the OpenAI API☆17Updated last year
- Knowledge Graph Generator app☆34Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last year
- Training LLMs to reason and analyze data with notebooks☆61Updated 4 months ago
- Sample solution to automate tedious regulatory compliance processes using multi-agent systems☆22Updated 9 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆65Updated last year
- A tutorial on DSPy and whether automated prompt engineering lives up to the hype☆25Updated last year
- ☆20Updated last month
- An introduction to DSPy☆32Updated 4 months ago
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year
- Awesome Orchest projects, both official and submitted by the community.☆26Updated 2 years ago
- ☆23Updated 2 months ago
- An open source code of the GitHub Copilot Workspace☆12Updated last year
- ☆20Updated 2 years ago
- Unified slicing for all Python data structures.☆37Updated 5 months ago
- ☆21Updated last year
- Blazing fast fuzzy text search for Python.☆51Updated 9 months ago