ofirpress / self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
☆310Updated last year
Alternatives and similar repositories for self-ask:
Users that are interested in self-ask are comparing it to the libraries listed below
- PaL: Program-Aided Language Models (ICML 2023)☆482Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆241Updated last year
- ☆268Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆235Updated 10 months ago
- [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"☆310Updated last year
- A method to fix GPT-3 after deployment with user feedback, without re-training.☆326Updated last year
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆385Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆279Updated 3 weeks ago
- ☆258Updated 8 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆254Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated last year
- ☆231Updated 2 years ago
- ☆178Updated 2 years ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆467Updated last year
- ☆172Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆161Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆351Updated last year
- Prompt programming with FMs.☆440Updated 7 months ago
- ☆160Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- ☆444Updated last year
- ☆275Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆542Updated last year
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆613Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆189Updated 7 months ago
- A codebase for "Language Models can Solve Computer Tasks"☆233Updated 10 months ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆543Updated last year