sfasfaffa / DLPOLinks
Official Code For: {DLPO : Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective}
☆9Updated 3 months ago
Alternatives and similar repositories for DLPO
Users that are interested in DLPO are comparing it to the libraries listed below
Sorting:
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆163Updated last month
- ☆281Updated last month
- ICLR 2025 Agent-Related Papers☆71Updated 8 months ago
- SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation☆38Updated last week
- ☆246Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆106Updated 5 months ago
- ☆186Updated this week
- ☆148Updated last week
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆130Updated last month
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆254Updated this week
- Deepseek R1 zero tiny version own reproduce on two A100s.☆71Updated 5 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆263Updated last week
- ☆320Updated last month
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆168Updated this week
- ☆148Updated 2 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆204Updated this week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆134Updated last week
- Awesome RL-based LLM Reasoning☆561Updated 2 months ago
- ☆274Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆228Updated 2 months ago
- Awesome Agent Training☆188Updated 2 weeks ago
- ☆149Updated 2 months ago
- ☆25Updated last month
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆85Updated 2 months ago
- CycleResearcher: Improving Automated Research via Automated Review☆210Updated last week
- Latest Advances on Long Chain-of-Thought Reasoning☆432Updated last week
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆268Updated 2 weeks ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆622Updated this week
- ☆18Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆284Updated 2 weeks ago