opendatalab / RESTLinks
☆32Updated 4 months ago
Alternatives and similar repositories for REST
Users that are interested in REST are comparing it to the libraries listed below
Sorting:
- ☆51Updated 2 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 6 months ago
- ☆53Updated 4 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 2 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆43Updated 2 months ago
- ☆38Updated 3 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 9 months ago
- ☆14Updated 6 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- ☆36Updated 2 months ago
- Geometric-Mean Policy Optimization☆95Updated 3 weeks ago
- DCPO: Dynamic Adaptive Clipping for RL☆44Updated 2 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 4 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆50Updated last month
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆26Updated last month
- SSRL: Self-Search Reinforcement Learning☆158Updated 3 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated this week
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last month
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆27Updated 2 months ago
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆52Updated last week
- [ACL 2025] Knowledge Unlearning for Large Language Models☆47Updated 2 months ago
- ☆62Updated last month
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆230Updated 2 weeks ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆245Updated 2 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆42Updated 2 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆51Updated last month
- ☆46Updated 2 months ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆67Updated 3 weeks ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Updated 5 months ago