opendatalab / RESTLinks
☆28Updated last month
Alternatives and similar repositories for REST
Users that are interested in REST are comparing it to the libraries listed below
Sorting:
- ☆25Updated last week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆68Updated 2 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆27Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 5 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆39Updated last week
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆54Updated 2 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated 2 weeks ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆161Updated last month
- JudgeLRM: Large Reasoning Models as a Judge☆32Updated 4 months ago
- ☆79Updated 4 months ago
- Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆23Updated 2 months ago
- ☆109Updated last week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆20Updated 3 weeks ago
- ☆44Updated 3 weeks ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆53Updated 2 months ago
- ☆15Updated 2 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 7 months ago
- ☆50Updated last month
- Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆40Updated 3 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆36Updated 2 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆62Updated 3 months ago
- ☆21Updated 2 months ago
- ☆18Updated last month
- ARM: Adaptive Reasoning Model☆46Updated 3 weeks ago
- ☆87Updated this week
- ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆108Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 2 months ago
- ☆22Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆38Updated 2 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆122Updated last month