Moocember / Optimization-by-PROmptingLinks

☆78

Alternatives and similar repositories for Optimization-by-PROmpting

Users that are interested in Optimization-by-PROmpting are comparing it to the libraries listed below

Sorting:

Anni-Zou / Meta-CoT
Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
☆99Updated 2 years ago
SALT-NLP / demonstrated-feedback
☆129Updated last year
LLM360 / Analysis360
Open Implementations of LLM Analyses
☆107Updated last year
google / sycophancy-intervention
Scripts for generating synthetic finetuning data for reducing sycophancy.
☆117Updated 2 years ago
uclaml / Rephrase-and-Respond
Official repo of Respond-and-Respond: data, code, and evaluation
☆104Updated last year
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆123Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆84Updated last year
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆72Updated last year
microsoft / simulated-trial-and-error
☆122Updated last year
declare-lab / flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…
☆111Updated 2 years ago
neulab / gemini-benchmark
☆150Updated last year
salesforce / BOLAA
☆185Updated 10 months ago
kyegomez / phi-1
Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
☆74Updated 2 years ago
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆63Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆169Updated 2 months ago
Lichang-Chen / InstructZero
Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…
☆197Updated last year
austrian-code-wizard / c3po
☆29Updated 4 months ago
lz1oceani / verify_cot
☆136Updated 2 years ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
Anni-Zou / DocBench
DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems
☆59Updated last year
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Updated last year
locuslab / scaling_laws_data_filtering
☆65Updated last year
18907305772 / FuseAI
FuseAI Project
☆87Updated 10 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
architsharma97 / dpo-rlaif
☆100Updated last year
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆93Updated last year
yueyu1030 / AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
☆156Updated 2 years ago
jakespringer / echo-embeddings
☆157Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
dvlab-research / MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
☆51Updated last year