opendatalab / RESTLinks
☆30Updated 2 months ago
Alternatives and similar repositories for REST
Users that are interested in REST are comparing it to the libraries listed below
Sorting:
- ☆33Updated 2 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆32Updated last month
- ☆22Updated this week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆96Updated 5 months ago
- Towards a Unified View of Large Language Model Post-Training☆111Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 6 months ago
- ☆42Updated last month
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆165Updated 2 months ago
- ☆51Updated 2 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last month
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated last month
- ☆15Updated 3 months ago
- ☆35Updated last month
- ☆127Updated 2 weeks ago
- ☆35Updated last month
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆60Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆36Updated 5 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆41Updated last month
- ☆45Updated this week
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆64Updated 4 months ago
- Segment Policy Optimization: Improved Credit Assignment in Reinforcement Learning for LLMs☆32Updated last month
- Geometric-Mean Policy Optimization☆74Updated last month
- ☆17Updated 9 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆38Updated last week
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆45Updated 2 months ago
- ARM: Adaptive Reasoning Model☆47Updated last month
- ☆22Updated last year
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆172Updated last month