Liyan06 / MiniCheckLinks

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]

☆174

Alternatives and similar repositories for MiniCheck

Users that are interested in MiniCheck are comparing it to the libraries listed below

Sorting:

wang-research-lab / agentinstruct
Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"
☆115Updated 10 months ago
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆107Updated last month
ParticleMedia / RAGTruth
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆192Updated 8 months ago
zai-org / ComplexFuncBench
Complex Function Calling Benchmark.
☆123Updated 6 months ago
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆268Updated 9 months ago
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆152Updated last year
apple / ToolSandbox
☆194Updated 11 months ago
jakespringer / echo-embeddings
☆152Updated last year
zjunlp / OneGen
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
☆148Updated 8 months ago
facebookresearch / ReasonIR
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆188Updated last month
chentong0 / factoid-wiki
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆159Updated last year
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 5 months ago
voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆130Updated 11 months ago
SalesforceAIResearch / SFR-RAG
☆77Updated 6 months ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆233Updated 9 months ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆67Updated last year
facebookresearch / CRAG
Comprehensive benchmark for RAG
☆204Updated last month
zorazrw / filco
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
☆193Updated last year
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆139Updated 8 months ago
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆222Updated last month
shengliu66 / ICV
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
☆182Updated 5 months ago
DataArcTech / LLM-as-a-Judge
☆128Updated 4 months ago
tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆360Updated 11 months ago
zorazrw / awesome-tool-llm
☆237Updated 11 months ago
AIR-Bench / AIR-Bench
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆150Updated last week
DaoD / INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
☆204Updated 7 months ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆291Updated 3 weeks ago
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆235Updated 11 months ago