Go2Heart/OmniStream

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Go2Heart/OmniStream)

Go2Heart / OmniStream

[ECCV 2026] OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams

☆113

Alternatives and similar repositories for OmniStream

Users that are interested in OmniStream are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Go2Heart / StreamFormer
View on GitHub
[ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.
☆93Updated this week
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58May 25, 2025Updated last year
THU-SI / Spatial-TTT
View on GitHub
[ECCV 2026] Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
☆238Jun 19, 2026Updated last month
haoningwu3639 / SimpleSDM-Video
View on GitHub
A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆20Feb 15, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
facebookresearch / DepthLM_Official
View on GitHub
[ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM
☆362Jun 1, 2026Updated last month
cskrren / vggtcore
View on GitHub
A unified framework for feed-forward neural networks
☆15Nov 28, 2025Updated 7 months ago
jbistanbul / universalvtg
View on GitHub
Official Code for the paper "UniversalVTG: A Univeral and Lightweight Foundation Model for Video Temporal Grounding"
☆15Apr 15, 2026Updated 3 months ago
QitaoZhao / E-RayZer
View on GitHub
[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.
☆300May 30, 2026Updated last month
ESI-Bench / ESI-Bench
View on GitHub
☆116Updated this week
Visionary-Laboratory / holi-spatial
View on GitHub
[ICML 2026 Oral] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence
☆366Jul 6, 2026Updated 2 weeks ago
Luo-Yihang / 4RC
View on GitHub
[ICML 2026] 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere
☆213Jul 7, 2026Updated last week
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆56May 20, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cg1177 / Recursive-Multimodal-Agent
View on GitHub
☆19Jul 1, 2026Updated 2 weeks ago
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
eldar / vdpm
View on GitHub
Official implementation of Video-DPM
☆242Jan 19, 2026Updated 6 months ago
YkiWu / Point3R
View on GitHub
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
☆191Mar 10, 2026Updated 4 months ago
NIRVANALAN / STream3R
View on GitHub
Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]
☆392May 8, 2026Updated 2 months ago
Becomebright / ReKV
View on GitHub
[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
☆121Nov 4, 2025Updated 8 months ago
haoningwu3639 / SimpleSDM-3
View on GitHub
A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.
☆27May 28, 2025Updated last year
JaceyHuang / Gen3R
View on GitHub
[CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
☆362Mar 20, 2026Updated 4 months ago
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆382Apr 17, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
VITA-Group / VLM-3R
View on GitHub
[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆428Updated this week
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
facebookresearch / lagernvs
View on GitHub
Official code for "LagerNVS Latent Geometry for Fully Neural Real-time Novel View Synthesis" (CVPR 2026)
☆402Jun 26, 2026Updated 3 weeks ago
THU-SI / Spatial-MLLM
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆479Feb 5, 2026Updated 5 months ago
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
GVCLab / MLLM-4D
View on GitHub
[ICML 2026] MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence
☆36May 1, 2026Updated 2 months ago
jzr99 / Geo4D
View on GitHub
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
☆437Jun 6, 2025Updated last year
zyrant / FI3Det
View on GitHub
[CVPR 2026] Few-Shot Incremental 3D Object Detection in Dynamic Indoor Environments
☆16Apr 10, 2026Updated 3 months ago
Yangr116 / VST
View on GitHub
[ECCV2026] Visual Spatial Tuning
☆198Mar 25, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 3 months ago
mll-lab-nu / ViewAgent
View on GitHub
☆20Jul 3, 2026Updated 2 weeks ago
facebookresearch / cowtracker
View on GitHub
CoWTracker: Tracking by Warping instead of Correlation
☆171Feb 5, 2026Updated 5 months ago
InternRobotics / G2VLM
View on GitHub
[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
☆345Apr 18, 2026Updated 3 months ago
IamCreateAI / NeoVerse
View on GitHub
[CVPR 2026 Highlight & Best Paper of VideoWorldModel Workshop] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
☆638May 12, 2026Updated 2 months ago
TencentARC / MotionCrafter
View on GitHub
[CVPR 2026 Highlight🔥] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
☆173Jun 11, 2026Updated last month
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago