dependentsign / Awesome-LLM-based-Evaluators
✨✨Latest Papers about LLM-based Evaluators
☆28Updated 11 months ago
Alternatives and similar repositories for Awesome-LLM-based-Evaluators:
Users that are interested in Awesome-LLM-based-Evaluators are comparing it to the libraries listed below
- Awesome LLM for NLG Evaluation Papers☆23Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆166Updated 3 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆109Updated 8 months ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆43Updated 3 weeks ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆39Updated 10 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆110Updated 6 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆198Updated 9 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆453Updated last year
- ☆68Updated 3 months ago
- ☆90Updated last week
- ☆277Updated last year
- Scaling Sentence Embeddings with Large Language Models☆104Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linked…☆159Updated 8 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆67Updated 11 months ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆242Updated 2 years ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆80Updated last month
- RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.☆124Updated 9 months ago
- A Survey on Data Selection for Language Models☆219Updated 5 months ago
- ☆69Updated last year
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆124Updated 8 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆147Updated 6 months ago
- ☆83Updated 5 months ago
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆30Updated 5 months ago
- Benchmarking library for RAG☆181Updated last week
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆124Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆107Updated 6 months ago
- ☆52Updated 7 months ago
- A Survey of Attributions for Large Language Models☆197Updated 7 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆96Updated 7 months ago