mostly-ai / mostlyai-qaLinks
Synthetic Data Quality Assurance π
β65Updated last week
Alternatives and similar repositories for mostlyai-qa
Users that are interested in mostlyai-qa are comparing it to the libraries listed below
Sorting:
- Synthetic Data Engine πβ72Updated 2 weeks ago
- Synthetic Data SDK β¨β700Updated last week
- A curated list of awesome synthetic data tools (open source and commercial).β231Updated 2 years ago
- Generate Python Package with Simple Promptsβ75Updated last year
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"β¦β25Updated 3 months ago
- An open-source compliance-centered evaluation framework for Generative AI modelsβ178Updated 3 weeks ago
- A Lightweight Library for AI Observabilityβ255Updated 11 months ago
- EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other laβ¦β92Updated last month
- β96Updated 3 months ago
- β19Updated 8 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.β141Updated 4 months ago
- A small library of LLM judgesβ314Updated 5 months ago
- Declarative context engineering for agentsβ433Updated last week
- DSPydantic: Auto-Optimize Your Pydantic Models with DSPyβ238Updated last month
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Updated last year
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessmentsβ251Updated last week
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.β138Updated last week
- A framework for evaluating RAG pipelines, specifically adapted for the legal domain.β73Updated 5 months ago
- A curated list of materials on AI guardrailsβ44Updated 7 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.β173Updated 2 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β128Updated 11 months ago
- Synthetic Text Dataset Generation for LLM projectsβ55Updated last month
- Train LLM on Hugging Face infraβ67Updated 2 months ago
- β237Updated last month
- β147Updated last year
- awesome synthetic (text) datasetsβ321Updated last week
- Open-source Python toolkit focused on deep learning with ordinal methodologiesβ65Updated last month
- Benchmark and optimize LLM inference across frameworks with easeβ158Updated 4 months ago
- Friends of OLMo and their links.β356Updated 4 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluationβ108Updated last year