FYYDCC/IVT-LR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FYYDCC/IVT-LR)

FYYDCC / IVT-LR

Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”

☆18

Alternatives and similar repositories for IVT-LR

Users that are interested in IVT-LR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VincentLeebang / lvr
View on GitHub
Official codebase for the paper Latent Visual Reasoning
☆169Oct 22, 2025Updated 8 months ago
Ruiyang-061X / Awesome-MLLM-Reasoning
View on GitHub
📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.
☆13Feb 7, 2025Updated last year
DoubtedSteam / MM-GCoT
View on GitHub
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆22Jul 21, 2025Updated last year
gogoczh / CoMT
View on GitHub
code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"
☆19Mar 10, 2025Updated last year
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆293Aug 2, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
Hanhpt23 / OmniMod
View on GitHub
MCOUT: Multimodal Chain of Continuous Thought for Latent Reasoning
☆21Oct 4, 2025Updated 9 months ago
Shengqi77 / Long-range-Turbulence-Mitigation
View on GitHub
[2024 ECCV] Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
☆12Jun 13, 2025Updated last year
CodeDance-VL / CodeDance
View on GitHub
☆32Mar 17, 2026Updated 4 months ago
edchengg / VAE_GAN
View on GitHub
VAE+GAN
☆10Apr 18, 2018Updated 8 years ago
UCSB-AI / DMLR
View on GitHub
[CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆84May 12, 2026Updated 2 months ago
AlbertTan404 / pytorch-open-x-embodiment
View on GitHub
Data pre-processing and training code on Open-X-Embodiment with pytorch
☆11Jan 20, 2025Updated last year
micky-li-hd / CoCo
View on GitHub
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
☆54Apr 9, 2026Updated 3 months ago
IDEA-Research / V-Reflection
View on GitHub
Related code, checkpoints and project page for V-Reflection
☆60Apr 7, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
JoakimHaurum / ATC
View on GitHub
Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024
☆20Sep 19, 2024Updated last year
xiaowudeshen / CRS-Paper-List
View on GitHub
In this repository, we summary a paper list of works in conversational recommendation system and its related areas.
☆15Sep 19, 2023Updated 2 years ago
120L020904 / ACE
View on GitHub
Official implementation of “ACE: Anti-Editing Concept Erasure in Text-to-Image Models”
☆17Jan 5, 2026Updated 6 months ago
qingyue2014 / MoE4DST
View on GitHub
☆12Jul 18, 2023Updated 3 years ago
ybb6 / laser
View on GitHub
☆34Apr 22, 2026Updated 2 months ago
shawnricecake / Heima
View on GitHub
[ICML 2026] Heima
☆75May 20, 2026Updated 2 months ago
windforfurture / DTCA
View on GitHub
for DTCA model
☆10Oct 17, 2023Updated 2 years ago
wangyu-ustc / LargeScaleWashing
View on GitHub
The official implementation of the paper "Large Scale Knowledge Washing"
☆10Jun 12, 2024Updated 2 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
suntea233 / DualLoRA
View on GitHub
Implementation of ACL 2024 paper "Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation".
☆15Nov 9, 2024Updated last year
BRZ911 / Wrong-of-Thought
View on GitHub
[EMNLP 2024 Findings] Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
☆13Oct 1, 2024Updated last year
LesterGong / MMRB
View on GitHub
The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"
☆19Jun 20, 2025Updated last year
aiming-lab / MIRA
View on GitHub
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought
☆31Feb 14, 2026Updated 5 months ago
zgMin / IT-RER-ABSA
View on GitHub
Official implementation for "Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis"
☆13Mar 23, 2026Updated 3 months ago
Liuning-He / Brazil-Conference-Survival-Guide
View on GitHub
A practical bilingual guide to staying safe and prepared at conferences in Brazil / 巴西参会实用攻略与自救指南
☆15Apr 22, 2026Updated 2 months ago
ninibymilk / PMF-MMEA
View on GitHub
[ACL2024] Progressively Modality Freezing for Multi-Modal Entity Alignment
☆19Apr 10, 2025Updated last year
xiaominli1020 / ReNeg
View on GitHub
ReNeg: Learning Negative Embedding with Reward Guidance
☆35Dec 22, 2025Updated 6 months ago
RookieJunChen / HIT-Computer-Network
View on GitHub
哈工大计算机网络课程相关仓库😁
☆16May 19, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yu-rp / NeuralLineage
View on GitHub
Code for CVPR 2024 Oral "Neural Lineage"
☆17Jun 18, 2024Updated 2 years ago
ttw1018 / MoPE-DST
View on GitHub
The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"
☆19Jan 25, 2025Updated last year
AI-Application-and-Integration-Lab / SAM4MLLM
View on GitHub
[ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation
☆51Mar 20, 2025Updated last year
AiHubCN / AiHubCN
View on GitHub
☆11May 1, 2026Updated 2 months ago
lcqysl / DiffThinker
View on GitHub
[ICML 2026] Official repo for "DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models"
☆185Jan 4, 2026Updated 6 months ago
mahtabbigverdi / Aurora-perception
View on GitHub
☆50Feb 18, 2026Updated 5 months ago
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆190May 1, 2026Updated 2 months ago