microsoft / CoNLI_hallucinationLinks

CoNLI: a plug-and-play framework for ungrounded hallucination detection and reduction

☆31

Alternatives and similar repositories for CoNLI_hallucination

Users that are interested in CoNLI_hallucination are comparing it to the libraries listed below

Sorting:

DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆68Updated last year
abhika-m / FAVA
☆74Updated last year
Re-Align / just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
☆87Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆82Updated last year
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
GasolSun36 / Iter-CoT
[NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
☆86Updated last year
OSU-NLP-Group / AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Updated 2 years ago
weizhepei / InstructRAG
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆127Updated 8 months ago
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆148Updated 11 months ago
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆54Updated last year
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆121Updated last year
mukhal / PromptRank
[ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting
☆27Updated 2 years ago
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated 4 months ago
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆153Updated last year
McGill-NLP / retriever-lm-reasoning
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Updated last year
WENGSYX / Self-Verification
We have released the code and demo program required for LLM with self-verification
☆63Updated 2 years ago
Leezekun / Directional-Stimulus-Prompting
[NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"
☆113Updated 2 years ago
HowieHwong / MetaTool
[ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
☆99Updated last year
awslabs / rag-qa-arena
☆48Updated last year
gankim / tree-of-clarifications
🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…
☆52Updated last year
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆123Updated 10 months ago
SALT-NLP / demonstrated-feedback
☆128Updated last year
LuJunru / MemoChat
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
☆28Updated last year
McGill-NLP / instruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆86Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
Anni-Zou / DocBench
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
☆47Updated last year
oriyor / ret-robust
Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"
☆73Updated last year
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆51Updated 2 months ago
qhjqhj00 / WebBrain
☆68Updated 2 years ago
voidism / EAR
Code for the ACL 2023 long paper - Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
☆37Updated 2 years ago