OpenSenseNova/SenseNova-MARS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenSenseNova/SenseNova-MARS)

OpenSenseNova / SenseNova-MARS

☆122

Alternatives and similar repositories for SenseNova-MARS

Users that are interested in SenseNova-MARS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SenseTime-FVG / InteractiveOmni
View on GitHub
☆24Dec 3, 2025Updated 7 months ago
Mini-o3 / Mini-o3
View on GitHub
Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"
☆423Jan 29, 2026Updated 6 months ago
ls-kelvin / REVPT
View on GitHub
Code for paper: Reinforced Vision Perception with Tools
☆74Oct 3, 2025Updated 9 months ago
Visual-Agent / DeepEyesV2
View on GitHub
☆625Feb 26, 2026Updated 5 months ago
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
OpenSenseNova / SenseNova-SI
View on GitHub
[CVPR 2026] Scaling Spatial Intelligence with Multimodal Foundation Models
☆290May 14, 2026Updated 2 months ago
ChenShawn / MultiModal-Jupyter-Sandbox
View on GitHub
Simple code sandbox supporting jupyter notebook style code execution. Used for agent training
☆25Dec 5, 2025Updated 7 months ago
daeunni / StreamGaze
View on GitHub
Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos"
☆27May 13, 2026Updated 2 months ago
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆50Jul 9, 2026Updated 2 weeks ago
EvolvingLMMs-Lab / multimodal-search-r1
View on GitHub
[ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…
☆470Apr 7, 2026Updated 3 months ago
lcqysl / FrameThinker
View on GitHub
[ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"
☆50Oct 9, 2025Updated 9 months ago
Liac-li / MM-self-improve-qwen2vl
View on GitHub
☆13Dec 9, 2024Updated last year
microsoft / PixelCraft
View on GitHub
[ICLR 2026] High-Fidelity Visual Reasoning on Structured Images
☆30Jul 17, 2026Updated last week
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TIGER-AI-Lab / Pixel-Reasoner
View on GitHub
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
☆301Jun 4, 2026Updated last month
CYWang735 / AdaTooler-V
View on GitHub
☆72Feb 27, 2026Updated 5 months ago
RenlyH / CodeV
View on GitHub
[CVPR 2026 Oral] Code with Image
☆31Dec 5, 2025Updated 7 months ago
AIoT-MLSys-Lab / MMDeepResearch-Bench
View on GitHub
MMDeepResearch-Bench (MMDR)
☆31Apr 1, 2026Updated 3 months ago
DingWu1021 / Promsa
View on GitHub
[ECCV 2026] Promsa: Progressive Multimodal Search Agents for Knowledge-Based Visual Question Answering
☆91Jul 7, 2026Updated 3 weeks ago
Osilly / Vision-DeepResearch
View on GitHub
[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of re…
☆658Jun 8, 2026Updated last month
EvolvingLMMs-Lab / LongVT
View on GitHub
[CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
☆257Jun 24, 2026Updated last month
ByteDance-BandAI / CodeVision
View on GitHub
[CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images
☆71Jan 23, 2026Updated 6 months ago
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆37Feb 22, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
shawn0728 / OpenSearch-VL
View on GitHub
🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…
☆258May 19, 2026Updated 2 months ago
Lucanyc / VISTA-Gym
View on GitHub
☆27Mar 17, 2026Updated 4 months ago
iSEE-Laboratory / Seg-ReSearch
View on GitHub
(ICML 2026) Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search
☆49May 1, 2026Updated 2 months ago
1benwu1 / VSRM-Efficient-LRMs
View on GitHub
Official repo of Promoting Efficient Reasoning with Verifiable Stepwise Reward
☆16Sep 9, 2025Updated 10 months ago
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆20Dec 9, 2025Updated 7 months ago
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
BytedanceDouyinContent / SAIL-VL2
View on GitHub
The SAIL-VL2 series model developed by the BytedanceDouyinContent Group
☆79Sep 18, 2025Updated 10 months ago
njucckevin / MM-Self-Improve
View on GitHub
A Self-Training Framework for Vision-Language Reasoning
☆90Jan 23, 2025Updated last year
leaves162 / CLIPtrase
View on GitHub
cliptrase
☆47Sep 1, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VisionOPD / Vision-OPD
View on GitHub
Vision-OPD is a regional-to-global on-policy self-distillation framework that transfers a model's own privileged crop-conditioned percept…
☆214Jul 17, 2026Updated last week
inclusionAI / Zooming-without-Zooming
View on GitHub
[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆179May 4, 2026Updated 2 months ago
We-Math / V-Thinker
View on GitHub
☆177Nov 26, 2025Updated 8 months ago
Visual-Agent / DeepEyes
View on GitHub
☆1,251Nov 20, 2025Updated 8 months ago
egolife-ai / Ego-R1
View on GitHub
[TPAMI 2026] Ego-R1: Agentic Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
☆165Jun 10, 2026Updated last month
xyq7 / Human-Contribution-Measurement
View on GitHub
☆13Jun 4, 2025Updated last year
ventr1c / Awesome-RL-based-Agentic-Search-Papers
View on GitHub
The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Eva…
☆280Jul 21, 2026Updated last week