Osilly/Vision-DeepResearch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Osilly/Vision-DeepResearch)

Osilly / Vision-DeepResearch

[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of reasoning turns to dozens and the number of search-engine interactions to hundreds.

☆656

Alternatives and similar repositories for Vision-DeepResearch

Users that are interested in Vision-DeepResearch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

suimuc / VIRES
View on GitHub
☆342Jul 4, 2025Updated last year
ByteDance-Seed / EvaLearn
View on GitHub
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…
☆431May 12, 2026Updated 2 months ago
ZinYY / TreeLoRA
View on GitHub
A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…
☆350Dec 15, 2025Updated 7 months ago
yixinzhang98 / otc_med_chat_agent
View on GitHub
An AI-powered conversational agent for recommending over-the-counter medications based on user symptoms and needs. Built with Python and …
☆198Jul 29, 2025Updated 11 months ago
GenerTeam / GENERanno
View on GitHub
GENERanno: A Genomic Foundation Model for Metagenomic Annotation
☆314Jun 15, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JyAether / Aether
View on GitHub
☆389May 5, 2025Updated last year
GabePersson / EmoVision
View on GitHub
☆590Oct 11, 2025Updated 9 months ago
yixinzhang98 / causal_inference_uplift_toolkits
View on GitHub
☆155Nov 14, 2025Updated 8 months ago
HKUDS / SepLLM
View on GitHub
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
☆571Jul 29, 2025Updated 11 months ago
WXY604 / LLM-based-causal-discovery
View on GitHub
☆837Jul 7, 2025Updated last year
renxh4 / CompressPng
View on GitHub
☆405Aug 31, 2022Updated 3 years ago
GaohaoZhou-ops / JetsonYoloROS
View on GitHub
This repository implements Yolo functionality using TensorRT and CUDA acceleration on Nvidia Jetson devices and the ROS framework.
☆205Aug 14, 2025Updated 11 months ago
AlgRUC / JittorGeometric
View on GitHub
JittorGeometric is a Jittor-based graph machine learning library.
☆1,078Jun 3, 2026Updated last month
Din829 / DbRheo-CLI
View on GitHub
A database operations and data analysis AI agent
☆432Aug 31, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xid32 / SoundMind
View on GitHub
We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…
☆1,110Nov 26, 2025Updated 7 months ago
CoderLineChan / SwiftlyUI
View on GitHub
UIKit Plus: Infusing SwiftUI-like Development Efficiency. Revolutionizing UIKit development through chain syntax, resultBuilder, and mode…
☆261Apr 15, 2026Updated 3 months ago
xxiaouw / SteamSmartBuy
View on GitHub
An intelligent Steam deal analytics dashboard leveraging Python, MySQL, and Power BI to surface the most worthwhile discounts.
☆156Jul 8, 2025Updated last year
jackdark425 / aigroupapp
View on GitHub
AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, pr…
☆1,100Sep 10, 2025Updated 10 months ago
Lerwee / Perseus
View on GitHub
采集管家
☆814Jan 20, 2026Updated 6 months ago
lachlanchen / OpenHI
View on GitHub
Self‑calibrated neuromorphic hyperspectral imaging pipeline for event cameras with diffractive illumination. Includes end‑to‑end tools f…
☆103Apr 27, 2026Updated 2 months ago
hzlab / Brain-Harmony
View on GitHub
Official codebase for "Brain Harmony: A Multimodal Foundation Model Unifying Morphology and Function into 1D Tokens" (NeruIPS 2025).
☆243Oct 26, 2025Updated 8 months ago
OTA-Tech-AI / web-agent-protocol
View on GitHub
🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support
☆502Jun 19, 2025Updated last year
hyperai / tvm-cn
View on GitHub
TVM Documentation in Chinese Simplified / TVM 中文文档
☆3,854May 20, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lyanlin96 / Application-Security-Ingress-Controller
View on GitHub
☆277Apr 29, 2025Updated last year
wenhaoli-xmu / seco
View on GitHub
☆163Nov 16, 2025Updated 8 months ago
Jiapeng-Pei / LLMSensitiveDataGoverance
View on GitHub
☆286Feb 21, 2026Updated 5 months ago
konmor / konmorreport
View on GitHub
☆114Aug 10, 2025Updated 11 months ago
Hunyuan-PromptEnhancer / PromptEnhancer
View on GitHub
[CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
☆3,729Jun 10, 2026Updated last month
greatInvoker / 2025-full-stack-tech-sharing
View on GitHub
2025技术分享（FullStack Frontend Focus），分享常用知识点。代码纯手打+AI验证，只做精品！！！
☆153Jul 2, 2025Updated last year
THESIS-AGENT / AIRouter
View on GitHub
🚀 AIRouter - 智能AI路由器：为多个LLM提供商提供统一API接口，支持负载均衡、故障转移和智能路由 | Intelligent AI Router with unified API interface, load balancing, and smart r…
☆179Aug 28, 2025Updated 10 months ago
kelvinfkr / adaptive-strategies-for-climate-change-adaptation-An-application-for-flood-risk-management
View on GitHub
data and codes for adaptive strategies for climate change adaptation: An application for flood risk management
☆134Feb 13, 2025Updated last year
NVlabs / MDP
View on GitHub
[CVPR 2025] MDP: Multidimensional Vision Model Pruning with Latency Constraint
☆169Sep 10, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HiGoalV / HiGoalVita
View on GitHub
HiGoalVita is a modular, layered, production ready AI RAG suite.
☆252May 22, 2025Updated last year
ximinng / SVGDreamerV2
View on GitHub
[T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv…
☆451Dec 13, 2024Updated last year
THESIS-AGENT / thesis-agent-demo
View on GitHub
一个基于多个大语言模型的智能学术范文写作系统，能够根据输入的开题报告或研究设计文档，自动生成包含引用的学术范文的各章节内容。
☆260Jul 14, 2025Updated last year
ModelEngine-Group / app-platform
View on GitHub
AppPlatform 是一个前沿的大模型应用工程，旨在通过集成的声明式编程和低代码配置工具，简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境，以支持从概念到部署的全流程 AI 应用开发。
☆1,446May 18, 2026Updated 2 months ago
WYKwong / LoLTrackGuard
View on GitHub
☆149Apr 2, 2026Updated 3 months ago
360CVGroup / WISA
View on GitHub
World Simulator Assistant for Physics-Aware Text-to-Video Generation
☆276Sep 22, 2025Updated 9 months ago
rainbowyuyu / manim_extend_rainbow
View on GitHub
Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …
☆206Dec 15, 2025Updated 7 months ago