yongchao98 / PROMSTLinks

Automatic prompt optimization framework for multi-step agent tasks.

☆32

Alternatives and similar repositories for PROMST

Users that are interested in PROMST are comparing it to the libraries listed below

Sorting:

thu-coai / SPaR
☆47Updated last month
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
McGill-NLP / agent-reward-bench
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
☆29Updated last week
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated 10 months ago
Open-Source-O1 / o1_Reasoning_Patterns_Study
☆103Updated 7 months ago
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated 10 months ago
fairyshine / Seal-Tools
The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…
☆52Updated 8 months ago
GAIR-NLP / OPO
☆50Updated last year
THUDM / LongReward
☆56Updated 9 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
matthewrenze / self-reflection
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
☆78Updated 8 months ago
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
☆47Updated 6 months ago
shizhediao / Post-Training-Data-Flywheel
We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.
☆57Updated 9 months ago
DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆66Updated last year
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆51Updated last month
icip-cas / SSO
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…
☆20Updated 8 months ago
feiyang-k / AutoScale
Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]
☆12Updated 6 months ago
meowpass / FollowComplexInstruction
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆50Updated last year
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆38Updated 5 months ago
swt-user / DMPO
☆43Updated 9 months ago
FlagOpen / Infinity-Instruct
☆48Updated last year
SalesforceAIResearch / FoFo
☆25Updated 6 months ago
ictnlp / LevelRAG
The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…
☆37Updated 3 months ago
lichengliu03 / unary-feedback
☆19Updated this week
tianyi-lab / MoE-Embedding
Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆76Updated 9 months ago
yegcjs / mixinglaws
☆103Updated 2 weeks ago
OFA-Sys / DiverseEvol
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
☆83Updated last year
princeton-pli / MeCo
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆41Updated last month
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆49Updated 7 months ago
yale-nlp / refdpo
☆16Updated last year