cvs-health / uqlm
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
☆61Updated this week
Alternatives and similar repositories for uqlm
Users that are interested in uqlm are comparing it to the libraries listed below
Sorting:
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆207Updated 3 weeks ago
- Client interface to Cleanlab Studio and the Trustworthy Language Model☆31Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- ☆57Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- ☆129Updated last month
- Functional Benchmarks and the Reasoning Gap☆86Updated 7 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆29Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- ☆38Updated 10 months ago
- Sphynx Hallucination Induction☆54Updated 3 months ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 7 months ago
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆36Updated 3 months ago
- ☆48Updated 6 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆29Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆72Updated last week
- Introduction to Data-Centric AI, MIT IAP 2023 🤖☆100Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆189Updated this week
- Fine-tune an LLM to perform batch inference and online serving.☆110Updated last week
- ☆18Updated 3 months ago
- Train transformer language models with reinforcement learning.☆18Updated 2 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆17Updated 10 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆74Updated last month
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated last month
- Open source interpretability artefacts for R1.☆131Updated 3 weeks ago
- A reasoning assistant for your STEM education☆19Updated 2 months ago
- ☆50Updated 5 months ago
- ☆31Updated 2 months ago