google-deepmind / oproLinks
official code for "Large Language Models as Optimizers"
☆650Updated 10 months ago
Alternatives and similar repositories for opro
Users that are interested in opro are comparing it to the libraries listed below
Sorting:
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆251Updated last year
 - Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models☆379Updated last year
 - Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers☆186Updated last month
 - [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆437Updated 3 weeks ago
 - ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆477Updated 6 months ago
 - List of language agents based on paper "Cognitive Architectures for Language Agents"☆1,056Updated 9 months ago
 - [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆797Updated last year
 - An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆356Updated last year
 - A library for advanced large language model reasoning☆2,295Updated 4 months ago
 - [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆668Updated 4 months ago
 - A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆892Updated last month
 - This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆333Updated 3 months ago
 - Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆409Updated last year
 - RewardBench: the first evaluation tool for reward models.☆646Updated 4 months ago
 - LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆753Updated last year
 - A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆381Updated 2 years ago
 - Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆303Updated last year
 - Official repository for ORPO☆463Updated last year
 - Code for Quiet-STaR☆739Updated last year
 - An extensible benchmark for evaluating large language models on planning☆426Updated last month
 - [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆424Updated last year
 - AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆1,064Updated last month
 - This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆556Updated last year
 - Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆214Updated 2 years ago
 - Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆175Updated last year
 - Representation Engineering: A Top-Down Approach to AI Transparency☆903Updated last year
 - papers related to LLM-agent that published on top conferences☆320Updated 6 months ago
 - SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆316Updated last year
 - ☆311Updated last year
 - Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆636Updated last month