Lichang-Chen / InstructZeroLinks

Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts!

☆195

Alternatives and similar repositories for InstructZero

Users that are interested in InstructZero are comparing it to the libraries listed below

Sorting:

google / sycophancy-intervention
Scripts for generating synthetic finetuning data for reducing sycophancy.
☆113Updated last year
lz1oceani / verify_cot
☆134Updated last year
bhargaviparanjape / language-programmes
☆172Updated 2 years ago
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆108Updated 2 months ago
declare-lab / flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…
☆111Updated last year
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆122Updated 8 months ago
uclaml / Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
☆103Updated last year
reasoning-machines / prompt-lib
A set of utilities for running few-shot prompting experiments on large-language models
☆122Updated last year
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆152Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆163Updated 3 months ago
Re-Align / URIAL
☆311Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
neelsjain / BYOD
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆107Updated last year
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆281Updated last year
veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆162Updated last year
FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆208Updated 2 years ago
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
microsoft / simulated-trial-and-error
☆122Updated last year
rxlqn / awesome-llm-self-reflection
augmented LLM with self reflection
☆128Updated last year
wenhuchen / TheoremQA
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
☆159Updated last year
Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆97Updated last year
neulab / gemini-benchmark
☆149Updated last year
Re-Align / just-eval
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
☆85Updated last year
princeton-nlp / intercode
[NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898
☆223Updated last year
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆289Updated 5 months ago
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆139Updated 9 months ago
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆200Updated 2 years ago
ryoungj / ToolEmu
[ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use
☆152Updated last year
shizhediao / active-prompt
Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"
☆243Updated last year
Gentopia-AI / Gentopia
Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.
☆320Updated last year