Lichang-Chen / InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!
☆172Updated last month
Related projects: ⓘ
- ☆284Updated 3 months ago
- FireAct: Toward Language Agent Fine-tuning☆242Updated 10 months ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆188Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆92Updated last month
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆131Updated 2 months ago
- RewardBench: the first evaluation tool for reward models.☆352Updated last week
- Self-Alignment with Principle-Following Reward Models☆144Updated 6 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆105Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- Chain-of-Hindsight, A Scalable RLHF Method☆213Updated 11 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆182Updated last month
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆109Updated last year
- A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆106Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆130Updated 2 months ago
- Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"☆108Updated 4 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆260Updated last year
- Generative Judge for Evaluating Alignment☆208Updated 8 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆303Updated 4 months ago
- ☆121Updated 10 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆133Updated 10 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆201Updated 10 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆207Updated last year
- ☆111Updated 3 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆134Updated 6 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆112Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆332Updated 11 months ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆436Updated 3 weeks ago
- PASTA: Post-hoc Attention Steering for LLMs☆96Updated last week
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆84Updated 11 months ago
- ☆166Updated last year