DataResponsibly / MirrorDataGeneratorLinks
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆24Updated 3 years ago
Alternatives and similar repositories for MirrorDataGenerator
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
Sorting:
- Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.☆82Updated 3 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆25Updated last year
- ☆24Updated 2 years ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 7 months ago
- Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Generating Realistic Synthetic Data☆41Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆24Updated 5 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆28Updated 4 months ago
- Knowledge Graph Generator app☆33Updated last year
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Real-time embeddings for data on the move☆21Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆61Updated last week
- python jupyter notebook tutorials☆13Updated 7 months ago
- A tutorial for building autonomous agents: with LangChain and from scratch☆30Updated 2 years ago
- A library to use `modal` as a backend for `joblib`.☆29Updated 7 months ago
- Retrieval Augmented Generation applications☆26Updated last year
- Leverage your LangChain trace data for fine tuning☆44Updated last year
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- examples and guides to using Nomic Atlas☆39Updated 4 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 11 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆38Updated last year
- 📚 Build knowledge bases for RAG☆24Updated last month
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆31Updated 2 years ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated last year
- Python package for extractive NLP using the OpenAI API☆17Updated 11 months ago
- A curated collection of example marimo notebooks — use these as templates for your own experiments, workflows, and tools.☆50Updated last week
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago