Lichang-Chen / InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
☆189Updated 7 months ago
Alternatives and similar repositories for InstructZero:
Users that are interested in InstructZero are comparing it to the libraries listed below
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆109Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated 7 months ago
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- ☆130Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- ☆306Updated 9 months ago
- ☆81Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆228Updated last month
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆279Updated last month
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆234Updated last year
- Evaluating LLMs with fewer examples☆147Updated 11 months ago
- Simple next-token-prediction for RLHF☆222Updated last year
- ☆172Updated last year
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆130Updated 11 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 3 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆218Updated 4 months ago
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆235Updated 10 months ago
- FireAct: Toward Language Agent Fine-tuning☆271Updated last year
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆183Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆136Updated 4 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆81Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆176Updated last month
- ☆178Updated 2 years ago