VincentLeebang/lvr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VincentLeebang/lvr)

VincentLeebang / lvr

Official codebase for the paper Latent Visual Reasoning

☆172

Alternatives and similar repositories for lvr

Users that are interested in lvr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NOVAglow646 / Monet
View on GitHub
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
☆215Mar 19, 2026Updated 4 months ago
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 6 months ago
Hanhpt23 / OmniMod
View on GitHub
MCOUT: Multimodal Chain of Continuous Thought for Latent Reasoning
☆21Oct 4, 2025Updated 9 months ago
UCSB-AI / DMLR
View on GitHub
[CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆85May 12, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Svardfox / LaViT
View on GitHub
Official codebase for the paper LaViT
☆34Feb 15, 2026Updated 5 months ago
XD111ds / ILVR
View on GitHub
[ACL'26 Oral] Interleaved Latent Visual Reasoning with Selective Perceptual Modeling
☆66May 29, 2026Updated 2 months ago
Wakals / CoVT
View on GitHub
[ECCV 2026] Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"
☆385Updated this week
heliossun / LaCoT
View on GitHub
[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning
☆36Oct 16, 2025Updated 9 months ago
ybb6 / laser
View on GitHub
☆35Apr 22, 2026Updated 3 months ago
TungChintao / SkiLa
View on GitHub
Official codes of "Sketch-in-Latents: Eliciting Unified Reasoning in MLLMs"
☆17Feb 15, 2026Updated 5 months ago
hwanyu112 / Latent-Sketchpad
View on GitHub
☆73Feb 1, 2026Updated 5 months ago
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆192May 1, 2026Updated 2 months ago
InternLM / SIM-CoT
View on GitHub
[ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"
☆211Apr 13, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xlyu0106 / Awesome-Latent-Space
View on GitHub
A paper list of Awesome Latent Space.
☆950Jul 13, 2026Updated 2 weeks ago
TencentBAC / RoT
View on GitHub
[ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
☆93Jan 22, 2026Updated 6 months ago
FanmengWang / ReGuLaR
View on GitHub
The official implementation of “ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought”
☆53Feb 2, 2026Updated 5 months ago
facebookresearch / coconut
View on GitHub
Training Large Language Model to Reason in a Continuous Latent Space
☆1,667Jul 2, 2026Updated 3 weeks ago
DJC-GO-SOLO / Latent-SFT
View on GitHub
Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.
☆55May 18, 2026Updated 2 months ago
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
EIT-NLP / Awesome-Latent-CoT
View on GitHub
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆366Jun 20, 2026Updated last month
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
ZiyuGuo99 / ATLAS
View on GitHub
One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods
☆137Jun 9, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mahtabbigverdi / Aurora-perception
View on GitHub
☆50Feb 18, 2026Updated 5 months ago
AI9Stars / CapImagine
View on GitHub
[ICML2026] Imagination Helps Visual Reasoning, But Not Yet in Latent Space
☆28May 4, 2026Updated 2 months ago
ModalityDance / Omni-R1
View on GitHub
[ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"
☆63May 26, 2026Updated 2 months ago
ByungKwanLee / Distill-R1
View on GitHub
Open-source RL Framework with Online Teacher-Student Distillation
☆22Mar 5, 2026Updated 4 months ago
Visual-Agent / DeepEyes
View on GitHub
☆1,253Nov 20, 2025Updated 8 months ago
MCG-NJU / RGE
View on GitHub
Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
☆15Nov 29, 2025Updated 8 months ago
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆153Feb 4, 2026Updated 5 months ago
Mini-o3 / Mini-o3
View on GitHub
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
☆423Jan 29, 2026Updated 6 months ago
multimodal-art-projection / LatentCoT-Horizon
View on GitHub
📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.
☆406Nov 5, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago
xlyu0106 / VisMem
View on GitHub
☆92Feb 5, 2026Updated 5 months ago
saccharomycetes / mllms_know
View on GitHub
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆381Apr 20, 2025Updated last year
mlrm-LEAD / mlrm-LEAD
View on GitHub
[CVPR 2026 Highlight] Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding
☆95Apr 9, 2026Updated 3 months ago
zhaochen0110 / Awesome_Think_With_Images
View on GitHub
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,499Mar 9, 2026Updated 4 months ago
IDEA-Research / V-Reflection
View on GitHub
Related code, checkpoints and project page for V-Reflection
☆60Apr 7, 2026Updated 3 months ago
Simon98-AI / Vedas
View on GitHub
☆56May 13, 2026Updated 2 months ago