google-deepmind / opro
official code for "Large Language Models as Optimizers"
☆445Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for opro
- Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models☆237Updated 7 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆745Updated this week
- Official repository for ORPO☆421Updated 5 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆471Updated 4 months ago
- RewardBench: the first evaluation tool for reward models.☆437Updated last month
- Code for Quiet-STaR☆654Updated 3 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆328Updated 9 months ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆224Updated 8 months ago
- ☆1,385Updated this week
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆253Updated last month
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆627Updated last month
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆355Updated last year
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆785Updated 2 months ago
- ☆940Updated 2 weeks ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆798Updated 2 months ago
- ☆529Updated 2 months ago
- A library for advanced large language model reasoning☆1,457Updated last week
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆886Updated last month
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆535Updated 3 weeks ago
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆685Updated 3 months ago
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆438Updated 3 weeks ago
- Automated Evaluation of RAG Systems☆486Updated 2 weeks ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆250Updated 6 months ago
- A curated list of awesome LLM agents.☆516Updated 2 weeks ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆523Updated 3 weeks ago
- Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers☆106Updated 5 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,048Updated 6 months ago
- Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them☆427Updated 4 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,847Updated 3 weeks ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆495Updated 2 weeks ago