Fu-Dayuan / PreActLinks
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆30Updated 11 months ago
Alternatives and similar repositories for PreAct
Users that are interested in PreAct are comparing it to the libraries listed below
Sorting:
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆147Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86Updated 6 months ago
- ☆46Updated 5 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆44Updated 3 months ago
- ☆51Updated 9 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Updated 3 months ago
- ☆53Updated 9 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 5 months ago
- Natural Language Reinforcement Learning☆100Updated 4 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- ☆20Updated 3 months ago
- ☆30Updated last year
- ☆65Updated 5 months ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆27Updated last year
- This the implementation of LeCo☆31Updated 10 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆60Updated last month
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆135Updated 2 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 10 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆108Updated 9 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆52Updated 3 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆58Updated 5 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆102Updated last month
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 6 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆62Updated 10 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- ☆42Updated last year
- ☆46Updated 2 months ago