Code Implementation of "WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation"
☆107Mar 18, 2026Updated this week
Alternatives and similar repositories for WorldCam
Users that are interested in WorldCam are comparing it to the libraries listed below
Sorting:
- Geometry-Aware Score Distillation via 3D Consistent Noising and Gradient Consistency Modeling☆29Sep 17, 2024Updated last year
- Wasserstein Gaussian Splatting☆17Dec 10, 2024Updated last year
- Official implementation of "AnthroTAP: Learning Point Tracking with Real-World Motion"☆25Mar 9, 2026Updated last week
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆46Nov 12, 2025Updated 4 months ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆11Jul 31, 2024Updated last year
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆68Apr 9, 2025Updated 11 months ago
- Official implementation of "CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models"☆42Feb 24, 2026Updated 3 weeks ago
- ☆12Mar 17, 2024Updated 2 years ago
- Pytorch implementation of: "Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment", ECCV22☆11Jul 22, 2022Updated 3 years ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆186Dec 25, 2025Updated 2 months ago
- Flash Sculptor: Modular 3D Worlds from Objects☆33Apr 13, 2025Updated 11 months ago
- [TMLR 2025] Monocular Dynamic Gaussian Splatting: Fast, Brittle, and Scene Complexity Rules☆28Aug 1, 2025Updated 7 months ago
- ☆23Jul 22, 2025Updated 7 months ago
- Official implementation of "MV-TAP: Tracking Any Point in Multi-View Videos"☆39Mar 10, 2026Updated last week
- Official implementation of "Unified Diffusion Transformer for High-Fidelity Text-Aware Image Restoration"☆27Dec 22, 2025Updated 2 months ago
- LatentMorph: Morphing Latent Reasoning into Image Generation☆37Feb 3, 2026Updated last month
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆48Feb 10, 2026Updated last month
- Tracking the latest and greatest research papers on diffusion large language models.☆23Mar 13, 2026Updated last week
- [CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing☆20Feb 29, 2024Updated 2 years ago
- Official implementation of "Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression"☆92Feb 8, 2026Updated last month
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆317Mar 9, 2026Updated last week
- (CVPR Workshop Best Paper Award) Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustn…☆17Nov 4, 2025Updated 4 months ago
- ☆124Jun 17, 2025Updated 9 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆341Feb 21, 2026Updated 3 weeks ago
- Code release of "Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion".☆106Oct 16, 2025Updated 5 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆182Mar 10, 2026Updated last week
- ☆91Nov 17, 2025Updated 4 months ago
- [ICLR 2024] "3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining"☆12Aug 25, 2024Updated last year
- ☆13Dec 17, 2022Updated 3 years ago
- Long-range camera-conditioned scene generation from one single image.☆107Dec 23, 2025Updated 2 months ago
- Official repository for SuperCATs : Cost Aggregation with Transformers for Sparse Correspondence (ICCE-Asia'22)☆18Dec 31, 2022Updated 3 years ago
- Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)☆25Dec 20, 2024Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆87Apr 24, 2025Updated 10 months ago
- World Modeling by Forecasting Vision Foundation Model Features☆37Jan 7, 2026Updated 2 months ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆373Aug 22, 2025Updated 6 months ago
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- a open-source Self-Reimplemented Version of the paper "RayZer: A Self-supervised Large View Synthesis Model"☆30Jan 4, 2026Updated 2 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆32Jun 12, 2025Updated 9 months ago