bruno686/VisPlay

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bruno686/VisPlay)

bruno686 / VisPlay

[CVPR'26] VisPlay: Self-Evolving Vision-Language Models

☆63

Alternatives and similar repositories for VisPlay

Users that are interested in VisPlay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mbzuai-oryx / EvoLMM
View on GitHub
Self Evolving Large Multimodal Models with Continuous Rewards
☆25Jun 9, 2026Updated last month
zli12321 / Vision-SR1
View on GitHub
Reinforcement Learning of Vision Language Models with Self Visual Perception Reward
☆175Mar 14, 2026Updated 4 months ago
zli12321 / MM-Zero
View on GitHub
Self-evolving vision language models from zero data
☆77Mar 14, 2026Updated 4 months ago
Chengsong-Huang / R-Zero
View on GitHub
[ICLR2026] codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
☆824Feb 4, 2026Updated 5 months ago
ErikZ719 / CoTA
View on GitHub
[ICLR 26] Context Tokens are Anchors: Understanding the Repeat Curse in dMLLMs from an Information Flow Perspective
☆16Mar 6, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
EvolvingLMMs-Lab / MGPO
View on GitHub
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
☆55Jul 23, 2025Updated 11 months ago
XMUDeepLIT / TTCS
View on GitHub
The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.
☆50Apr 22, 2026Updated 2 months ago
Qwen-Applications / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆20Dec 30, 2025Updated 6 months ago
VisionOPD / Vision-OPD
View on GitHub
Vision-OPD is a regional-to-global on-policy self-distillation framework that transfers a model's own privileged crop-conditioned percept…
☆197Updated this week
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
Hongyang-Du / awesome-3d-datasets
View on GitHub
[CVPRW'26] A collection and survey of 3d dataset
☆33Jun 4, 2026Updated last month
HJYao00 / R1-ShareVL
View on GitHub
[NeurIPS 2025] Reasoning MLLM, Share-GRPO, advantage vanishing, sparse reward
☆38Sep 19, 2025Updated 10 months ago
Hongyang-Du / VideoGPA
View on GitHub
[ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.
☆70Jun 6, 2026Updated last month
mm-vl / ULM-R1
View on GitHub
Co-Reinforcement Learning for Unified Multimodal Understanding and Generation
☆48Jul 22, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Feilong607 / FarSight
View on GitHub
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding (CVPR 2025 Oral)
☆42Nov 28, 2025Updated 7 months ago
latentcraft / replay
View on GitHub
[CVPR 2026] Boosting Reasoning in Large Multimodal Models via Activation Replay
☆24May 7, 2026Updated 2 months ago
zli12321 / VideoHallu
View on GitHub
Synthetic Video hallucination and Mitigation
☆23Sep 21, 2025Updated 10 months ago
Visual-Agent / DeepEyes
View on GitHub
☆1,250Nov 20, 2025Updated 8 months ago
zhaochen0110 / OpenThinkIMG
View on GitHub
OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.
☆399Jun 1, 2025Updated last year
GeWu-Lab / MokA
View on GitHub
MokA: Multimodal Low-Rank Adaptation for MLLMs
☆91Dec 30, 2025Updated 6 months ago
ZJU-REAL / SpatialLadder
View on GitHub
[ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models
☆99Jun 9, 2026Updated last month
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆19Aug 21, 2025Updated 11 months ago
lose4578 / CircleRoPE
View on GitHub
☆15Sep 1, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
chang-jl / EfficientFlow
View on GitHub
EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI
☆25Jan 17, 2026Updated 6 months ago
AMAP-ML / CoEvolve
View on GitHub
CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution
☆19Apr 27, 2026Updated 2 months ago
heliossun / LaCoT
View on GitHub
[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning
☆36Oct 16, 2025Updated 9 months ago
Chengsong-Huang / G-Zero
View on GitHub
☆25May 14, 2026Updated 2 months ago
Philip-MIT / rover-vlm
View on GitHub
☆18Dec 1, 2025Updated 7 months ago
xing0047 / cca-llava
View on GitHub
[NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention
☆67Aug 30, 2025Updated 10 months ago
ziplab / CoV
View on GitHub
[ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning
☆63Apr 7, 2026Updated 3 months ago
ZJU-REAL / SpatialEvo
View on GitHub
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
☆81Apr 16, 2026Updated 3 months ago
wuxiyang1996 / COS-PLAY
View on GitHub
COS-PLAY: Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Game Play
☆29Jul 11, 2026Updated last week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
KarnaYip / C2RoPE
View on GitHub
[ICRA 26] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning
☆27Feb 13, 2026Updated 5 months ago
vlf-silkie / VLFeedback
View on GitHub
☆102Dec 22, 2023Updated 2 years ago
lgxi24 / AdaBlock-dLLM
View on GitHub
[ICLR 2026] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size
☆15Jan 28, 2026Updated 5 months ago
wantbook-book / SeRL
View on GitHub
SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
☆24Jan 24, 2026Updated 5 months ago
XiaoyuXu-Vincent / step-saliency
View on GitHub
Official code for paper "Reasoning Fails Where Step Flow Breaks" (ACL 2026)
☆18Apr 19, 2026Updated 3 months ago
StevenZHB / CoT_Causal_Analysis
View on GitHub
Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"
☆23Feb 19, 2025Updated last year
zhengkid / Parallel-R1
View on GitHub
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆260Feb 4, 2026Updated 5 months ago