hustvl/4DLangVGGT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hustvl/4DLangVGGT)

hustvl / 4DLangVGGT

Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”

☆91

Alternatives and similar repositories for 4DLangVGGT

Users that are interested in 4DLangVGGT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LAW1223 / AlignVid
View on GitHub
☆24May 29, 2026Updated 2 months ago
KyleHuang9 / SeFAR
View on GitHub
[AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
☆30Jan 3, 2025Updated last year
EnVision-Research / ScalingAR
View on GitHub
[ICML 2026] ScalingAR: Scaling Confidence for Autoregressive Image Generation
☆22May 5, 2026Updated 2 months ago
XianfengWu01 / LightGen
View on GitHub
An Efficient Text-to-Image Generation Pretrain Pipeline
☆132Apr 18, 2025Updated last year
DuNGEOnmassster / VideoGen-of-Thought
View on GitHub
[Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minim…
☆63Sep 22, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GeekGuru123 / ProfilingDiT
View on GitHub
☆20Jan 1, 2026Updated 6 months ago
WU-CVGL / SIU3R
View on GitHub
[NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…
☆164Sep 25, 2025Updated 10 months ago
3DAgentWorld / VGGT4D
View on GitHub
The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”
☆268Dec 2, 2025Updated 7 months ago
lifuguan / IGGT_official
View on GitHub
[ICLR'26] IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction
☆427Dec 1, 2025Updated 7 months ago
Luo-Yihang / 4RC
View on GitHub
[ICML 2026] 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere
☆215Jul 7, 2026Updated 3 weeks ago
sosppxo / mvggt
View on GitHub
[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Ref…
☆128Mar 24, 2026Updated 4 months ago
naver / panst3r
View on GitHub
PanSt3R: Multi-view Consistent Panoptic Segmentation (official code)
☆80Mar 20, 2026Updated 4 months ago
lif314 / SmartSplat
View on GitHub
[AAAI 2026] SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images
☆19Dec 26, 2025Updated 7 months ago
Livioni / OmniVGGT-official
View on GitHub
[CVPR 2026 Hightlight] OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
☆352May 21, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VDIGPKU / EA3D
View on GitHub
☆35Nov 17, 2025Updated 8 months ago
InternRobotics / G2VLM
View on GitHub
[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
☆346Apr 18, 2026Updated 3 months ago
cvlab-kaist / RobustVGGT
View on GitHub
[CVPR'26] Official implementation of "Emergent Outlier View Rejection in Visual Geometry Grounded Transformers"
☆217Feb 22, 2026Updated 5 months ago
wzzheng / StreamVGGT
View on GitHub
[ICLR 2026] Streaming 4D Visual Geometry Transformer
☆945Oct 27, 2025Updated 9 months ago
LAW1223 / OpenSubject
View on GitHub
☆55Dec 10, 2025Updated 7 months ago
eldar / vdpm
View on GitHub
Official implementation of Video-DPM
☆242Jan 19, 2026Updated 6 months ago
ai-kunkun / PASA
View on GitHub
[ICML 2026] PASA: A Principled Embedding-Space Watermarking Approach for LLM-Generated Text under Semantic-Invariant Attacks
☆23May 13, 2026Updated 2 months ago
RuijieZhu94 / ObjectGS
View on GitHub
[ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
☆140Jul 16, 2026Updated last week
hustvl / Spa3R
View on GitHub
Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning
☆51Mar 25, 2026Updated 4 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TencentARC / MotionCrafter
View on GitHub
[CVPR 2026 Highlight🔥] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
☆176Updated this week
zrporz / 4DLangSplat
View on GitHub
Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)
☆205Oct 10, 2025Updated 9 months ago
andvg3 / eVGGT
View on GitHub
Code for "Improving Robotic Manipulation with Efficient Geometry-Aware Vision Encoder"
☆28Oct 14, 2025Updated 9 months ago
AutoLab-SAI-SJTU / InfiniteVGGT
View on GitHub
The official implementation of InfiniteVGGT
☆379Apr 19, 2026Updated 3 months ago
Jho-Yonsei / SwiftVGGT
View on GitHub
[CVPR 2026 Findings] SwiftVGGT: A Scalable Visual Geometry Grounded Transformer for Large-Scale Scenes
☆95Nov 25, 2025Updated 8 months ago
dtc111111 / Reloc-VGGT
View on GitHub
☆20Dec 25, 2025Updated 7 months ago
lck666666 / plana3r
View on GitHub
[NeurIPS 2025] the official project page of a paper, "PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splattin…
☆75May 4, 2026Updated 2 months ago
facebookresearch / 4DGT
View on GitHub
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
☆469Sep 19, 2025Updated 10 months ago
WenjieShu / LoopViT
View on GitHub
☆46Feb 4, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JaceyHuang / Gen3R
View on GitHub
[CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction
☆366Mar 20, 2026Updated 4 months ago
Any-4D / Any4D
View on GitHub
Any4D: Unified Feed-Forward Metric 4D Reconstruction
☆388Apr 17, 2026Updated 3 months ago
Linketic / VGGT-X
View on GitHub
VGGT-X: When VGGT Meets Dense Novel View Synthesis
☆318Feb 4, 2026Updated 5 months ago
ywh187 / XStreamVGGT
View on GitHub
🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …
☆47Jan 27, 2026Updated 6 months ago
xiac20 / SimRecon
View on GitHub
[CVPR'26 Highlight] SimRecon: SimReady Compositional Scene Reconstruction from Real Videos
☆133Apr 14, 2026Updated 3 months ago
hustvl / TBCM
View on GitHub
Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs
☆21Dec 16, 2025Updated 7 months ago
LiZizun / WinT3R
View on GitHub
Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Pool
☆229Mar 4, 2026Updated 4 months ago