XD111ds/ILVR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XD111ds/ILVR)

XD111ds / ILVR

[ACL'26 Oral] Interleaved Latent Visual Reasoning with Selective Perceptual Modeling

☆65

Alternatives and similar repositories for ILVR

Users that are interested in ILVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VincentLeebang / lvr
View on GitHub
Official codebase for the paper Latent Visual Reasoning
☆170Oct 22, 2025Updated 8 months ago
Svardfox / LaViT
View on GitHub
Official codebase for the paper LaViT
☆34Feb 15, 2026Updated 5 months ago
ybb6 / laser
View on GitHub
☆34Apr 22, 2026Updated 2 months ago
TungChintao / SkiLa
View on GitHub
Official codes of "Sketch-in-Latents: Eliciting Unified Reasoning in MLLMs"
☆17Feb 15, 2026Updated 5 months ago
NOVAglow646 / Monet
View on GitHub
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
☆207Mar 19, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆293Aug 2, 2025Updated 11 months ago
Maplebb / LoMo
View on GitHub
Offline implementation of LoMo: Local Modality Substitution for Deeper Vision-Language Fusion.
☆25Jun 1, 2026Updated last month
EnigmaYYYY / SocialClaw
View on GitHub
SocialClaw is a screen-aware social copilot that watches live chat windows, builds personalized memory and profile context, and suggests …
☆40Apr 9, 2026Updated 3 months ago
UCSB-AI / DMLR
View on GitHub
[CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆84May 12, 2026Updated 2 months ago
hwanyu112 / Latent-Sketchpad
View on GitHub
☆73Feb 1, 2026Updated 5 months ago
heliossun / LaCoT
View on GitHub
[NeurIPS 2025] Official code for paper: Latent Chain-of-Thought for Visual Reasoning
☆36Oct 16, 2025Updated 9 months ago
FanmengWang / ReGuLaR
View on GitHub
The official implementation of “ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought”
☆53Feb 2, 2026Updated 5 months ago
ZiyuGuo99 / ATLAS
View on GitHub
One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods
☆137Jun 9, 2026Updated last month
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ALEX-nlp / DenoiseRL
View on GitHub
DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes
☆36Updated this week
CYWang735 / AdaTooler-V
View on GitHub
☆71Feb 27, 2026Updated 4 months ago
Shredded-Pork / Flash-GRPO
View on GitHub
[ICML 2026] Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
☆58Jun 11, 2026Updated last month
daniel-cores / tvbench
View on GitHub
TVBench: Redesigning Video-Language Evaluation
☆15Jun 9, 2025Updated last year
CFinTech / SparseSSM
View on GitHub
[arxiv 2025] SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
☆21Oct 8, 2025Updated 9 months ago
HumanMLLM / IRG-MotionLLM
View on GitHub
(ECCV2026) Official repository of paper "IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Gene…
☆30Jul 1, 2026Updated 2 weeks ago
WxxShirley / Agent-STAR
View on GitHub
Official implementation for paper "Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe"
☆32May 12, 2026Updated 2 months ago
Hanhpt23 / OmniMod
View on GitHub
MCOUT: Multimodal Chain of Continuous Thought for Latent Reasoning
☆21Oct 4, 2025Updated 9 months ago
SPIRAL-MED / MedMCP-Calc
View on GitHub
☆23Jun 28, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Koreyoshi01 / VISD
View on GitHub
This repository is the official implementation for VISD.
☆21May 17, 2026Updated 2 months ago
THUNLP-MT / Brote
View on GitHub
☆11Jan 19, 2025Updated last year
inclusionAI / Zooming-without-Zooming
View on GitHub
[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆174May 4, 2026Updated 2 months ago
xlyu0106 / Awesome-Latent-Space
View on GitHub
A paper list of Awesome Latent Space.
☆946Jul 13, 2026Updated last week
shiyi-zh0408 / NAE_CVPR2024
View on GitHub
[CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction
☆43May 16, 2024Updated 2 years ago
Accio-Lab / SwimBird
View on GitHub
☆18Apr 9, 2026Updated 3 months ago
ltpo2025 / LTPO
View on GitHub
[ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization
☆32Mar 6, 2026Updated 4 months ago
redorangeyellowy / AttentionHand
View on GitHub
Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…
☆12May 11, 2026Updated 2 months ago
s-vco / s-vco
View on GitHub
Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images
☆19Jun 4, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
spatigen / milr
View on GitHub
Official code of paper: MILR: Improving Multimodal Image Generation via Test-Time Latent Reasoning
☆18Feb 12, 2026Updated 5 months ago
WorkerAmo / wenquanshuju-pdf-downloader
View on GitHub
针对文泉书局已购买版权内容下载JS脚本（原文泉学堂改）
☆15Mar 11, 2020Updated 6 years ago
iSEE-Laboratory / Revisting_FSCIL
View on GitHub
Official PyTorch implementation of our ECCV2024 paper “Rethinking Few-shot Class-incremental Learning: Learning from Yourself”
☆21Jan 12, 2025Updated last year
wendell0218 / Janus-Pro-R1
View on GitHub
[NeurIPS 2025] Official repository of the paper "Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Compreh…
☆23Sep 27, 2025Updated 9 months ago
TencentBAC / RoT
View on GitHub
[ACL 2026] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
☆93Jan 22, 2026Updated 5 months ago
xiaomi-research / colar
View on GitHub
[NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆97Jun 29, 2026Updated 3 weeks ago
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago