xufangzhi / GeniusLinks
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆68Updated 2 months ago
Alternatives and similar repositories for Genius
Users that are interested in Genius are comparing it to the libraries listed below
Sorting:
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆104Updated 2 weeks ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 3 weeks ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆104Updated 2 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆32Updated 3 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆51Updated 2 months ago
- ☆47Updated 5 months ago
- ARM: Adaptive Reasoning Model☆45Updated 2 weeks ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆134Updated 2 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆58Updated 9 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 6 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆47Updated 3 months ago
- ☆53Updated 2 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆161Updated last month
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆59Updated this week
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆120Updated last month
- Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆96Updated last month
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated 2 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆76Updated 4 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆107Updated 2 months ago
- ☆26Updated 2 weeks ago
- ☆48Updated 3 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆78Updated 4 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Updated 8 months ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆69Updated 3 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆43Updated 3 weeks ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆105Updated 2 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆176Updated this week
- A repo for open research on building large reasoning models☆87Updated this week
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆108Updated 2 weeks ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 4 months ago