DataResponsibly / MirrorDataGenerator
MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. It focuses on how features relate with demographic attributes (e.g. gender, race, disability status, etc), which are considered as sensitive information for certain domains (e.g. employment, housing, etc).
☆21Updated 2 years ago
Alternatives and similar repositories for MirrorDataGenerator:
Users that are interested in MirrorDataGenerator are comparing it to the libraries listed below
- This repository implements DSPy programs to tasks in Indian Languages☆11Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆26Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar real…☆24Updated 3 months ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 5 months ago
- AI_Powered_Dev_Search_Engine☆12Updated 10 months ago
- ChatBot App built using LangChain and Lightning AI☆18Updated last year
- A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪☆15Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆46Updated 9 months ago
- Python package for extractive NLP using the OpenAI API☆16Updated 4 months ago
- examples and guides to using Nomic Atlas☆27Updated 4 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆23Updated 10 months ago
- Fun stuff to do with runhouse☆24Updated last year
- Compound AI toolchain for fast and accurate entity matching, powered by LLMs.☆20Updated 3 weeks ago
- ☆12Updated last year
- Example Code to Supplement the Label Studio Blog☆20Updated this week
- Run LLMs on Replicate with vLLM☆15Updated 3 months ago
- ☆18Updated 3 months ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- ☆17Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆22Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆37Updated 10 months ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 2 weeks ago
- ☆12Updated this week
- A personal knowledge base that I can dump information to and help me learn☆23Updated 7 months ago
- ☆11Updated last year
- Automatic Test Generator☆11Updated last year