Shenzhi-Wang / reconLinks
The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)
☆13Updated last year
Alternatives and similar repositories for recon
Users that are interested in recon are comparing it to the libraries listed below
Sorting:
- ☆24Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"☆65Updated 10 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆113Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 4 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Updated 11 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Updated 6 months ago
- ☆53Updated 9 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆21Updated last year
- ☆46Updated 6 months ago
- Natural Language Reinforcement Learning☆100Updated 4 months ago
- ☆51Updated 9 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆44Updated 3 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated last year
- ☆33Updated 6 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆47Updated last year
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆73Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 7 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆47Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆113Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- ☆72Updated last month
- ☆69Updated 5 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆139Updated 2 months ago
- FuseAI Project☆87Updated 10 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆62Updated 11 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- ☆105Updated last year