mostly-ai / mostlyai-qaLinks
Synthetic Data Quality Assurance π
β64Updated last week
Alternatives and similar repositories for mostlyai-qa
Users that are interested in mostlyai-qa are comparing it to the libraries listed below
Sorting:
- Synthetic Data Engine πβ66Updated this week
- Synthetic Data SDK β¨β677Updated this week
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"β¦β23Updated 2 weeks ago
- Benchmark and optimize LLM inference across frameworks with easeβ125Updated last month
- β19Updated 5 months ago
- A Lightweight Library for AI Observabilityβ251Updated 8 months ago
- The missing middleware layer in healthcare AI π« π₯β164Updated this week
- An open-source compliance-centered evaluation framework for Generative AI modelsβ169Updated this week
- Generate Python Package with Simple Promptsβ73Updated 11 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Updated 11 months ago
- Train LLM on Hugging Face infraβ65Updated last month
- β232Updated 3 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessmentsβ239Updated this week
- A curated list of awesome synthetic data tools (open source and commercial).β217Updated last year
- A small library of LLM judgesβ296Updated 3 months ago
- Named Entity Recognition using Claude Citationsβ79Updated 4 months ago
- A framework for pitting LLMs against each other in an evolving library of games ββ33Updated 6 months ago
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.β315Updated last week
- A framework for evaluating RAG pipelines, specifically adapted for the legal domain.β68Updated 3 months ago
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injectionβ¦β27Updated 7 months ago
- Synthetic Text Dataset Generation for LLM projectsβ43Updated last week
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understβ¦β23Updated 7 months ago
- β146Updated last year
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other laβ¦β90Updated last week
- β214Updated this week
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.β50Updated this week
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.β132Updated this week
- Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputsβ22Updated 3 months ago
- Official Implementation of "Affordable AI Assistants with Knowledge Graph of Thoughts"β144Updated last month
- Build reliable AI and agentic applications with DataFramesβ373Updated this week