mostly-ai / mostlyai-qaLinks
Synthetic Data Quality Assurance π
β65Updated last month
Alternatives and similar repositories for mostlyai-qa
Users that are interested in mostlyai-qa are comparing it to the libraries listed below
Sorting:
- Synthetic Data Engine πβ69Updated last week
- Synthetic Data SDK β¨β690Updated last week
- An open-source compliance-centered evaluation framework for Generative AI modelsβ174Updated this week
- A curated list of awesome synthetic data tools (open source and commercial).β228Updated last year
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessmentsβ242Updated last week
- β92Updated last month
- A Lightweight Library for AI Observabilityβ252Updated 9 months ago
- Benchmark and optimize LLM inference across frameworks with easeβ148Updated 3 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"β¦β24Updated last month
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.β137Updated this week
- A general library for generating high-quality synthetic data from scratch or based on your own seed data.β403Updated this week
- A curated list of materials on AI guardrailsβ43Updated 6 months ago
- Synthetic Text Dataset Generation for LLM projectsβ52Updated 2 weeks ago
- DSPydantic: Auto-Optimize Your Pydantic Models with DSPyβ67Updated last week
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Updated last year
- Train LLM on Hugging Face infraβ67Updated last month
- A small library of LLM judgesβ306Updated 4 months ago
- A framework for evaluating RAG pipelines, specifically adapted for the legal domain.β71Updated 4 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other laβ¦β92Updated 2 weeks ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β160Updated last week
- Build reliable AI and agentic applications with DataFramesβ415Updated this week
- An alignment auditing agent capable of quickly exploring alignment hypothesisβ708Updated 2 weeks ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.β173Updated last week
- Lightweight Nearest Neighbors with Flexible Backendsβ322Updated 2 months ago
- β37Updated last year
- Unified Schema-Based Information Extractionβ355Updated this week
- Curate High Quality Datasets, Train, Evaluate and Ship! πβ675Updated this week
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ293Updated last month
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatioβ¦β44Updated last week
- Build datasets using natural languageβ551Updated 2 months ago