mostly-ai / mostlyai-qaLinks
Synthetic Data Quality Assurance π
β60Updated last month
Alternatives and similar repositories for mostlyai-qa
Users that are interested in mostlyai-qa are comparing it to the libraries listed below
Sorting:
- Synthetic Data Engine πβ64Updated last week
- Synthetic Data SDK β¨β622Updated this week
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findingsβ34Updated 9 months ago
- β19Updated 3 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"β¦β23Updated 8 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessmentsβ230Updated this week
- A framework for fine-tuning retrieval-augmented generation (RAG) systems.β127Updated this week
- Generate Python Package with Simple Promptsβ71Updated 9 months ago
- The missing middleware layer in healthcare AI π« π₯β126Updated last week
- An open-source compliance-centered evaluation framework for Generative AI modelsβ161Updated this week
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"β54Updated 3 months ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatioβ¦β42Updated 2 weeks ago
- Interactive Variational Autoencoder (VAE)β58Updated 10 months ago
- π Automatically annotate papers using LLMsβ343Updated 4 months ago
- A framework for pitting LLMs against each other in an evolving library of games ββ33Updated 4 months ago
- A curated list of awesome synthetic data tools (open source and commercial).β201Updated last year
- Wonderful Matrices to Build Small Language Modelsβ44Updated 6 months ago
- A small library of LLM judgesβ271Updated last month
- OLAPH: Improving Factuality in Biomedical Long-form Question Answeringβ39Updated 11 months ago
- β230Updated last month
- A method for steering llms to better follow instructionsβ49Updated 3 weeks ago
- Open-source Python toolkit focused on deep learning with ordinal methodologiesβ56Updated last month
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"β25Updated last year
- Lightweight Nearest Neighbors with Flexible Backendsβ297Updated last month
- A Lightweight Library for AI Observabilityβ250Updated 6 months ago
- Friends of OLMo and their links.β287Updated 8 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.β168Updated 2 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β120Updated 6 months ago
- β131Updated 3 months ago
- AI Verifyβ28Updated this week