google-deepmind / opro
official code for "Large Language Models as Optimizers"
☆439Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for opro
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆738Updated last week
- Code for Quiet-STaR☆646Updated 2 months ago
- RewardBench: the first evaluation tool for reward models.☆428Updated 3 weeks ago
- Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models☆229Updated 7 months ago
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆225Updated 7 months ago
- Official repository for ORPO☆419Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,554Updated 2 months ago
- ☆920Updated last week
- This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.☆432Updated 2 weeks ago
- Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality s…☆480Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆492Updated last week
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆678Updated 3 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆245Updated 5 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆705Updated last week
- ☆493Updated 3 weeks ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆349Updated last year
- A library for advanced large language model reasoning☆1,420Updated 2 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆467Updated 4 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆428Updated 6 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆882Updated 3 weeks ago
- Generative Representational Instruction Tuning☆562Updated last week
- Official implementation of the paper Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers☆103Updated 5 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆618Updated last month
- Automatically evaluate your LLMs in Google Colab☆557Updated 6 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆239Updated last year
- A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.☆314Updated last year
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆199Updated 3 months ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆781Updated 4 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,036Updated 6 months ago
- ☆1,266Updated this week