THU-KEG / KoLA
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆50Updated last year
Alternatives and similar repositories for KoLA:
Users that are interested in KoLA are comparing it to the libraries listed below
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆79Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆60Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 5 months ago
- A framework for editing the CoTs for better factuality☆47Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆46Updated 8 months ago
- ☆38Updated last year
- ☆39Updated last year
- ☆60Updated 6 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆65Updated 2 weeks ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated last month
- ☆85Updated last year
- Towards Systematic Measurement for Long Text Quality☆31Updated 4 months ago
- ☆28Updated last year
- ☆31Updated last year
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 6 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆39Updated 10 months ago
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆23Updated 3 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆58Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆64Updated 9 months ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- ☆60Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆88Updated 2 months ago
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆59Updated 9 months ago
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆84Updated last year
- Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"☆21Updated last year
- self-adaptive in-context learning☆42Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆66Updated 2 years ago