opendatalab / RESTLinks
☆28Updated 2 weeks ago
Alternatives and similar repositories for REST
Users that are interested in REST are comparing it to the libraries listed below
Sorting:
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 3 weeks ago
- ☆19Updated this week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆20Updated last week
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆94Updated last week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 6 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆29Updated last month
- Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆20Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆66Updated last month
- ☆50Updated 2 weeks ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 4 months ago
- ☆77Updated 3 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆51Updated 3 weeks ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 2 weeks ago
- ☆15Updated 10 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated 2 months ago
- ☆32Updated 3 months ago
- ☆20Updated last month
- Efficient Agent Training for Computer Use☆117Updated last month
- Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆37Updated 2 months ago
- Code for Let LLMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆43Updated this week
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆19Updated last week
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆31Updated last month
- ☆57Updated last week
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆158Updated 3 weeks ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆19Updated 4 months ago
- ☆76Updated 3 weeks ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆61Updated 2 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆38Updated 9 months ago
- ARM: Adaptive Reasoning Model☆45Updated last month