hengzzzhou / ReSo
☆13Updated last month
Alternatives and similar repositories for ReSo
Users that are interested in ReSo are comparing it to the libraries listed below
Sorting:
- ☆38Updated this week
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆20Updated 5 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆26Updated 5 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆26Updated last month
- ☆42Updated 2 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"☆90Updated last week
- Multimodal RewardBench☆39Updated 2 months ago
- Natural Language Reinforcement Learning☆87Updated 4 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆56Updated 2 months ago
- ☆45Updated 3 months ago
- ☆20Updated 6 months ago
- A Self-Training Framework for Vision-Language Reasoning☆77Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆35Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 6 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆21Updated 2 months ago
- ☆75Updated 4 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆34Updated 4 months ago
- ☆41Updated 4 months ago
- ☆40Updated 6 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆73Updated last month
- ☆10Updated 3 weeks ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 3 weeks ago
- ☆21Updated 2 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated 10 months ago
- ☆14Updated 4 months ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆34Updated this week
- [ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision☆62Updated 10 months ago
- ☆38Updated last month
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 2 months ago