Lichang-Chen / InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
☆189Updated 7 months ago
Alternatives and similar repositories for InstructZero:
Users that are interested in InstructZero are comparing it to the libraries listed below
- ☆130Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆279Updated 3 weeks ago
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆218Updated 4 months ago
- ☆305Updated 9 months ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆205Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆227Updated 3 weeks ago
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 3 months ago
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- ☆81Updated last year
- Simple next-token-prediction for RLHF☆222Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆108Updated last year
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- ☆120Updated 9 months ago
- ☆172Updated last year
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆130Updated 11 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆180Updated 7 months ago
- ☆167Updated last year
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆102Updated 3 weeks ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆233Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆81Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆288Updated 9 months ago
- ☆77Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆130Updated 4 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆84Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆214Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year