dmis-lab / ETHICLinks
[NAACL 2025] ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage
β15Updated last month
Alternatives and similar repositories for ETHIC
Users that are interested in ETHIC are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".β22Updated last year
- π² Code for our EMNLP 2023 paper - π "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeβ¦β52Updated last year
- [ICLR 2025] ChroKnowledge: Unveiling Chronological Knowledge of Language Models in Multiple Domainsβ16Updated 7 months ago
- Official codebase for permutation self-consistency.β18Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionβ93Updated 11 months ago
- official repository for ListT5β48Updated 8 months ago
- Code and data for the FACTOR paperβ52Updated last year
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"β19Updated last month
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"β77Updated last year
- β75Updated last year
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"β31Updated 10 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasksβ¦β41Updated 10 months ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)β16Updated last year
- β85Updated 9 months ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)β22Updated 2 years ago
- β19Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β121Updated last year
- β20Updated last year
- β19Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)β59Updated last year
- β17Updated 2 years ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledgeβ28Updated last year
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Modelsβ47Updated last year
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languagesβ37Updated 2 months ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"β54Updated last year
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"β39Updated last year
- Code base of In-Context Learning for Dialogue State trackingβ45Updated 2 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesβ51Updated 2 months ago
- β31Updated last year
- β74Updated last year