THU-KEG / KoLALinks
[ICLR24] The open-source repo of THU-KEG's KoLA benchmark.
☆51Updated 2 years ago
Alternatives and similar repositories for KoLA
Users that are interested in KoLA are comparing it to the libraries listed below
Sorting:
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated 2 years ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆82Updated 2 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated 2 years ago
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆78Updated 5 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62Updated 2 years ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆41Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆99Updated 11 months ago
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆99Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆36Updated last year
- ☆141Updated 2 years ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 5 months ago
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆132Updated 2 years ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆42Updated last year
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆49Updated last year
- ☆46Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆32Updated 2 years ago
- Repository for Decomposed Prompting☆93Updated last year
- ☆88Updated 2 years ago
- self-adaptive in-context learning☆45Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆69Updated 5 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆47Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆40Updated last year
- ☆17Updated 7 months ago
- ☆67Updated 3 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆66Updated 2 years ago
- Released code for our ICLR23 paper.☆66Updated 2 years ago