xinke-wang / LVLM-PlaygroundLinks
[ICLR2025] Are Large Vision Language Models Good Game Players?
☆13Updated 9 months ago
Alternatives and similar repositories for LVLM-Playground
Users that are interested in LVLM-Playground are comparing it to the libraries listed below
Sorting:
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆93Updated 7 months ago
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆58Updated last year
- ☆63Updated last month
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆90Updated 3 months ago
- ☆32Updated 4 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆169Updated 6 months ago
- Official Repository of LatentSeek☆70Updated 6 months ago
- ☆112Updated 3 months ago
- ☆111Updated 4 months ago
- [NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reason…☆150Updated 3 months ago
- ☆55Updated 6 months ago
- The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆122Updated 3 weeks ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆64Updated 4 months ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…☆20Updated 6 months ago
- Multimodal RewardBench☆55Updated 10 months ago
- The official code repository for the FullFront benchmark☆25Updated 7 months ago
- ☆297Updated 2 months ago
- A Collection of Papers on Diffusion Language Models☆149Updated 3 months ago
- Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)☆59Updated 8 months ago
- Extending context length of visual language models☆12Updated last year
- TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆62Updated 3 weeks ago
- Official github repo of G-LLaVA☆148Updated 10 months ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆20Updated this week
- Visual Planning: Let's Think Only with Images☆285Updated 7 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 5 months ago
- [NeurIPS 25] The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning☆24Updated 2 months ago
- ☆55Updated 3 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆65Updated 3 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆101Updated 3 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆45Updated last week