[ICLR26] ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding
☆42Mar 20, 2026Updated this week
Alternatives and similar repositories for thinkomni
Users that are interested in thinkomni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated last month
- Towards Generalizable Robotic Manipulation in Dynamic Environments☆34Updated this week
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆55Apr 9, 2025Updated 11 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated last year
- VisuRiddles: Fine-grained Perception is a important thing for Multimodal Large Models in Riddles Solving☆18Oct 22, 2025Updated 5 months ago
- ☆16Apr 21, 2025Updated 11 months ago
- [ICCV 2025] LIRA☆21Nov 25, 2025Updated 3 months ago
- 🚀2026年波场TRX靓号地址生成器,USDT钱包靓号生成器,利用 gpu 进行加速,代码开源,安全可靠。TRON vanity address generator, use GPU, opensource, safety, enjoy.☆201Jan 24, 2026Updated last month
- ☆24Feb 27, 2026Updated 3 weeks ago
- ☆107Feb 5, 2026Updated last month
- Awesome GPT-4 with Applications. This is a collection of resources related to GPT-4, including news, official documents, demo and applica…☆20Mar 15, 2023Updated 3 years ago
- The official GitHub repository of the paper "Recent advances in large language model benchmarks against data contamination: From static t…☆546Mar 3, 2026Updated 2 weeks ago
- MCP server for browsing, searching, and exporting Cursor AI chat history.☆29Mar 8, 2026Updated 2 weeks ago
- Unity Asset Store Auto Downloader☆84Updated this week
- 基于 Python 与 DeepSeek 的《地平线4》AI 语音助手,支持实时遥测数据驱动的智能副驾交互。☆22Feb 20, 2026Updated last month
- A telegram communication bot written in golang.☆212Mar 12, 2026Updated last week
- ArcticDB-backed time series cache with incremental updates — fetch once, upsert the gap.☆57Updated this week
- 14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero L…☆1,837Updated this week
- ☆33Dec 1, 2025Updated 3 months ago
- The Chongqing University Bituminous Pavement Disease Detection Dataset (CQU-BPDD)☆13Apr 17, 2025Updated 11 months ago
- ☆121Jan 18, 2026Updated 2 months ago
- the official code of DriveMonkey☆45Updated this week
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆240Jul 14, 2025Updated 8 months ago
- Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization (DynaMO) - Official Implementation☆88Mar 10, 2026Updated last week
- Youtu-RAG: Next-Generation Agentic Intelligent Retrieval-Augmented Generation System☆232Mar 5, 2026Updated 2 weeks ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆360Dec 18, 2025Updated 3 months ago
- ☆41Mar 6, 2026Updated 2 weeks ago
- ☆56Oct 3, 2024Updated last year
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]☆185Mar 12, 2026Updated last week
- ☆51Feb 13, 2026Updated last month
- Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching☆291Aug 29, 2025Updated 6 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆776Feb 2, 2026Updated last month
- ☆31Jun 14, 2024Updated last year
- A fully modular, framework-agnostic, easy-to-extend SDK for building complex X402 payment integrations.☆52Updated this week
- DataCompare is a Java-based tool designed to verify the consistency of data after replication or migration operations are completed betwe…☆199Mar 2, 2026Updated 3 weeks ago
- [ICRA 2026] UniFuture: A 4D Driving World Model for Future Generation and Perception☆147Feb 26, 2026Updated 3 weeks ago
- The code for the paper "Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders" (AAAI'24).☆37Dec 26, 2023Updated 2 years ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 3 years ago
- 2D-MUSIC to estimate time-of-flight and angle-of-arrival in simulated radar data.☆109Jan 21, 2026Updated 2 months ago