zoezheng126/Spatio-Temporal-LLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zoezheng126/Spatio-Temporal-LLM)

zoezheng126 / Spatio-Temporal-LLM

☆19

Alternatives and similar repositories for Spatio-Temporal-LLM

Users that are interested in Spatio-Temporal-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

irom-princeton / spine
View on GitHub
Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields
☆32Oct 3, 2025Updated 9 months ago
LaVi-Lab / Video-3D-LLM
View on GitHub
[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.
☆218Jun 4, 2025Updated last year
neu-vi / struct2d
View on GitHub
Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)
☆31Oct 28, 2025Updated 8 months ago
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated 11 months ago
PeiwenSun2000 / SpaceVista
View on GitHub
The official repo for SpaceVista: All-Scale Visual Spatial Reasoning from mm to km.
☆43May 26, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LaVi-Lab / VG-LLM
View on GitHub
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
☆245Nov 28, 2025Updated 7 months ago
chenyize111 / X2DFD
View on GitHub
☆10Apr 9, 2026Updated 3 months ago
kimren227 / DiffConvex
View on GitHub
☆19Jul 20, 2024Updated 2 years ago
WU-CVGL / GS-Reasoner
View on GitHub
Reasoning in Space via Grounding in the World (ICLR 2025)
☆56Nov 3, 2025Updated 8 months ago
beacon-3d / Beacon3D
View on GitHub
[CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA
☆28Nov 25, 2025Updated 7 months ago
zhoujiahuan1991 / CVPR2025-STOP
View on GitHub
☆19May 8, 2025Updated last year
minghangz / OnVTG
View on GitHub
Online video temporal grounding
☆16Oct 20, 2025Updated 9 months ago
djiajunustc / 3D-LLaVA
View on GitHub
[CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
☆100May 26, 2025Updated last year
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Updated this week
wl-zhao / ivg_network
View on GitHub
☆17Jul 6, 2021Updated 5 years ago
VAISR / OVGGT
View on GitHub
[ECCV 2026] OVGGT is a training-free framework enabling streaming 3D reconstruction from arbitrarily long video with constant memory.
☆46Jul 6, 2026Updated 2 weeks ago
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 3 months ago
MrZihan / g3D-LF
View on GitHub
Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).
☆56Jul 14, 2025Updated last year
Yanbo-23 / OGGSplat
View on GitHub
☆20Jun 11, 2025Updated last year
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
Rainzor / STAC
View on GitHub
[CVPR 2026 Highlight] Official STAC: Plug-and-Play Spatio-Temporal Aware Cache Compression for Streaming 3D Reconstruction
☆33Jun 15, 2026Updated last month
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HVision-NKU / MaskCLIPpp
View on GitHub
Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"
☆47Mar 25, 2025Updated last year
SunYangtian / UniGeo
View on GitHub
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
☆136Jun 10, 2025Updated last year
MINT-SJTU / STI-Bench
View on GitHub
STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
☆39Jan 12, 2026Updated 6 months ago
gbliao / SPC-GS
View on GitHub
[CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
☆20Aug 27, 2025Updated 10 months ago
WU-CVGL / SIU3R
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…
☆163Sep 25, 2025Updated 9 months ago
Yanbo-23 / Proto-Comp
View on GitHub
☆19Nov 18, 2024Updated last year
botianzhe / LVLM-DFD
View on GitHub
☆19Feb 10, 2026Updated 5 months ago
facebookresearch / univlg
View on GitHub
Unifying 2D and 3D Vision-Language Understanding
☆126Jul 2, 2026Updated 2 weeks ago
liuxu0303 / EReFormer
View on GitHub
☆27Nov 15, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
InternRobotics / MMSI-Video-Bench
View on GitHub
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
☆60Mar 11, 2026Updated 4 months ago
epic-kitchens / C1-Action-Recognition-TSN-TRN-TSM
View on GitHub
EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM
☆33Mar 15, 2022Updated 4 years ago
3DLLM-Mem / 3DLLM-Mem
View on GitHub
☆27Jun 5, 2025Updated last year
haoningwu3639 / SpatialScore
View on GitHub
[CVPR 2026 Highlight] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
☆84May 28, 2026Updated last month
zhichengLuxx / GaGS
View on GitHub
[CVPR 2024] 3D Geometry-aware Deformable Gaussian Splatting for Dynamic View Synthesis.
☆22Apr 23, 2025Updated last year
AutoCompSysLab / ContextNav
View on GitHub
This repository represents the official implementation of the paper titled "Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3…
☆18Jun 23, 2026Updated 3 weeks ago
MasterHow / E-3DGS
View on GitHub
Pytorch implementation of the paper 'E-3DGS: Gaussian Splatting with Exposure and Motion Events'
☆23Jan 8, 2025Updated last year