liujch1998 / GKPLinks
☆99Updated 2 years ago
Alternatives and similar repositories for GKP
Users that are interested in GKP are comparing it to the libraries listed below
Sorting:
- Official Code for "PPT: Pre-trained Prompt Tuning for Few-shot Learning". ACL 2022☆110Updated 3 years ago
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Updated 2 years ago
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆78Updated 6 months ago
- ☆88Updated 2 years ago
- A large-scale complex question answering evaluation of ChatGPT and similar large-language models☆40Updated last year
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆131Updated 2 years ago
- Do Large Language Models Know What They Don’t Know?☆101Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆257Updated 2 years ago
- ☆141Updated 2 years ago
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"☆112Updated 2 years ago
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆61Updated last year
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆43Updated last year
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆51Updated 2 years ago
- Paper collections of retrieval-based (augmented) language model.☆232Updated last year
- ☆43Updated 2 years ago
- paper list on reasoning in NLP☆194Updated 7 months ago
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Updated 2 years ago
- ☆177Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆83Updated 2 years ago
- ☆294Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆74Updated 3 years ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆48Updated last year
- ☆64Updated 2 years ago
- ☆46Updated last year
- Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"☆58Updated 3 years ago
- ☆32Updated last year
- RARR: Researching and Revising What Language Models Say, Using Language Models☆49Updated 2 years ago
- This respository contains the code for extracting the test samples we used in our paper: "A Multitask, Multilingual, Multimodal Evaluatio…☆80Updated last year
- ☆28Updated last year