dmis-lab / ETHIC
[NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
☆14Updated last month
Alternatives and similar repositories for ETHIC:
Users that are interested in ETHIC are comparing it to the libraries listed below
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domains☆13Updated this week
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆20Updated 5 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆84Updated 3 months ago
- 🌲 Code for our EMNLP 2023 paper - 🎄 "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Mode…☆48Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆22Updated 5 months ago
- Dataset and Evaluation Code for the K-QA Benchmark.☆14Updated 8 months ago
- Official codebase for permutation self-consistency.☆16Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆42Updated 2 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆37Updated 2 months ago
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆48Updated 9 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆21Updated 2 months ago
- official repository for ListT5☆43Updated last week
- Learning from Negative samples for Biomedical Generative Entity Linking☆17Updated 5 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆34Updated 2 months ago
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆47Updated 2 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆65Updated 10 months ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Updated last year
- ☆65Updated last year
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models☆43Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆14Updated 2 months ago
- ☆17Updated last year
- Official repository of FactKG☆55Updated last week
- Code and data for the FACTOR paper☆44Updated last year
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆54Updated 9 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 8 months ago
- Code and data for "Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation" (EMNLP 2023…☆30Updated 9 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆17Updated 3 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆72Updated 2 weeks ago
- AbstainQA, ACL 2024☆25Updated 4 months ago
- ☆23Updated last year