opendatalab / RESTLinks
☆31Updated 3 months ago
Alternatives and similar repositories for REST
Users that are interested in REST are comparing it to the libraries listed below
Sorting:
- ☆43Updated 2 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 4 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 7 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆23Updated 2 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated last month
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆27Updated 2 weeks ago
- ☆36Updated 2 weeks ago
- ☆50Updated 3 months ago
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆33Updated last week
- ☆29Updated last month
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆41Updated this week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 9 months ago
- DCPO: Dynamic Adaptive Clipping for RL☆37Updated last month
- Geometric-Mean Policy Optimization☆86Updated last week
- ☆38Updated 2 months ago
- ☆15Updated 4 months ago
- ☆51Updated 3 months ago
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆56Updated 2 weeks ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆65Updated 5 months ago
- [NeurIPS 2025] Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆49Updated 3 weeks ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆38Updated last month
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆31Updated 2 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆41Updated last month
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆10Updated last month
- ☆45Updated 3 weeks ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆39Updated 2 months ago
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆24Updated last week
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last week
- AgenTracer: A Lightweight Failure Attributor for Agentic Systems☆55Updated last month
- SSRL: Self-Search Reinforcement Learning☆147Updated 2 months ago