ZiyuGuo99/ATLAS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZiyuGuo99/ATLAS)

ZiyuGuo99 / ATLAS

One Discrete Word for Visual Reasoning Overtakes Agentic and Latent Methods

☆137

Alternatives and similar repositories for ATLAS

Users that are interested in ATLAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZiyuGuo99 / MME-CoF
View on GitHub
Are Video Models Ready as Zero-shot Reasoners?
☆87Nov 24, 2025Updated 8 months ago
ZiyuGuo99 / Thinking-while-Generating
View on GitHub
The first Interleaved framework for textual reasoning within the visual generation process
☆165Mar 16, 2026Updated 4 months ago
NOVAglow646 / Monet
View on GitHub
[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"
☆215Mar 19, 2026Updated 4 months ago
thuml / Reasoning-Visual-World
View on GitHub
Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…
☆100Mar 9, 2026Updated 4 months ago
VincentLeebang / lvr
View on GitHub
Official codebase for the paper Latent Visual Reasoning
☆171Oct 22, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XD111ds / ILVR
View on GitHub
[ACL'26 Oral] Interleaved Latent Visual Reasoning with Selective Perceptual Modeling
☆66May 29, 2026Updated 2 months ago
CYWang735 / AdaTooler-V
View on GitHub
☆72Feb 27, 2026Updated 5 months ago
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago
hwanyu112 / Latent-Sketchpad
View on GitHub
☆73Feb 1, 2026Updated 5 months ago
UMass-Embodied-AGI / Mirage
View on GitHub
[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
☆294Aug 2, 2025Updated 11 months ago
hyperai / tvm-cn
View on GitHub
TVM Documentation in Chinese Simplified / TVM 中文文档
☆3,868May 20, 2026Updated 2 months ago
ThinkMorph / ThinkMorph
View on GitHub
[ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
☆192May 1, 2026Updated 2 months ago
Fugtemypt123 / ToolVQA-release
View on GitHub
Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools
☆31Nov 3, 2025Updated 8 months ago
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Hunyuan-PromptEnhancer / PromptEnhancer
View on GitHub
[CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
☆3,739Jun 10, 2026Updated last month
ybb6 / laser
View on GitHub
☆35Apr 22, 2026Updated 3 months ago
UCSB-AI / DMLR
View on GitHub
[CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆85May 12, 2026Updated 2 months ago
WildDataX / suppr-zotero-plugin
View on GitHub
Translate PDF, Word, PowerPoint, etc. | zotero翻译插件，微信扫码注册，新用户可免费翻译25万汉字或100万个英文字母。超能文献官网:suppr.wilddata.cn；
☆2,008Jun 24, 2026Updated last month
ZhuoyangLiu2005 / last0
View on GitHub
[ICML 2026] LaST$_0$: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model
☆87Apr 30, 2026Updated 2 months ago
xid32 / SoundMind
View on GitHub
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…
☆1,110Nov 26, 2025Updated 8 months ago
xinyan-cxy / OpenCoF
View on GitHub
OpenCoF: Learning to Reason Through Video Generation
☆73Jul 10, 2026Updated 2 weeks ago
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 6 months ago
microsoft / TimeCraft
View on GitHub
Official code for TimeCraft: A Time Series Generation Framework for Real-World Applications
☆1,082Feb 12, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bytedance / UniVR
View on GitHub
☆29Updated this week
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
inclusionAI / Zooming-without-Zooming
View on GitHub
[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆179May 4, 2026Updated 2 months ago
ZhenhaoPeng / WhiteLagoon
View on GitHub
☆48Apr 14, 2025Updated last year
FullAgent / fulling
View on GitHub
Fulling is an AI-powered Full-stack Engineer Agent. Built with Next.js, Claude, shadcn/ui, and PostgreSQL. Use kubernetes as infra.
☆2,434Updated this week
fim-ai / fim-one
View on GitHub
Open-source agent platform for Global × China enterprises — wire every system through one agent core. Self-hosted, any LLM.
☆1,372Updated this week
arctanxarc / GENIUS
View on GitHub
☆43May 9, 2026Updated 2 months ago
xiongsi2000 / web-embed-chatbot
View on GitHub
☆40Jun 10, 2025Updated last year
phoenix-zhou / goshop
View on GitHub
本项目是一个基于 Golang Gin 框架开发的 B2C 电商平台，采用 MVC（Model-View-Controller）架构进行模块化设计，能够扩展为实现前后端分离，支持后台商品管理、用户系统、订单交易、支付集成、数据分析等功能,系统地展…
☆1,069Oct 12, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
EvolvingLMMs-Lab / ParaVT
View on GitHub
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
☆54Jun 2, 2026Updated last month
limix-ldm-ai / LimiX
View on GitHub
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
☆3,852Jun 16, 2026Updated last month
MME-Benchmarks / MME-CoT
View on GitHub
MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency
☆136Aug 5, 2025Updated 11 months ago
PKU-Alignment / align-anything
View on GitHub
Align Anything: Training All-modality Model with Feedback
☆4,664Nov 27, 2025Updated 8 months ago
Klavis-AI / klavis
View on GitHub
Klavis AI: MCP integration platforms that let AI agents use tools reliably at any scale
☆5,779Jun 1, 2026Updated last month
Kail-Fu / InterviewOS
View on GitHub
Replace coding puzzles with real-work simulations.
☆1,906Jul 10, 2026Updated 2 weeks ago
jy1993 / SimpleRL
View on GitHub
☆240Dec 13, 2025Updated 7 months ago