dmis-lab / ETHICLinks
[NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
β15Updated 3 months ago
Alternatives and similar repositories for ETHIC
Users that are interested in ETHIC are comparing it to the libraries listed below
Sorting:
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionβ95Updated last year
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β52Updated 2 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".β22Updated last year
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domainsβ17Updated 9 months ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"β21Updated 3 weeks ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"β79Updated last year
- Enhancing contextual understanding in large language models through contrastive decodingβ21Updated last year
- Official codebase for permutation self-consistency.β18Updated last year
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dialβ¦β25Updated last year
- β21Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"β53Updated last year
- β17Updated 2 years ago
- Personalized Story Evaluation Modelβ18Updated 2 years ago
- official repository for ListT5β48Updated 2 weeks ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)β30Updated last month
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Modelsβ47Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesβ52Updated 4 months ago
- First explanation metric (diagnostic report) for text generation evaluationβ62Updated 9 months ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"β14Updated 4 months ago
- β39Updated last year
- β76Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β126Updated last year
- Awesome LLM for NLG Evaluation Papersβ25Updated last year
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answeringβ37Updated last year
- Code and data for the FACTOR paperβ52Updated 2 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utβ¦β23Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)β64Updated 2 years ago
- β189Updated 5 months ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.β16Updated 2 years ago
- β89Updated 11 months ago