chen-wl20 / GenWorldLinks
GenWorld: Towards Detecting AI-generated Real-world Simulation Videos
☆32Updated 4 months ago
Alternatives and similar repositories for GenWorld
Users that are interested in GenWorld are comparing it to the libraries listed below
Sorting:
- SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis☆36Updated 4 months ago
- [CVPR 2025] UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting☆47Updated last month
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆48Updated last month
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆54Updated 2 months ago
- [NeurIPS 2025]《SD-VLM: Spatial Measuring and Understanding with Depth-encoded Vision Language Models》☆24Updated 3 weeks ago
- Project Page for GaussianFormer☆24Updated last year
- ☆111Updated 4 months ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆134Updated last month
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆21Updated 5 months ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆111Updated 3 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆61Updated last month
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 7 months ago
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆65Updated 4 months ago
- ☆34Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆41Updated 3 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆83Updated this week
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆133Updated 4 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆36Updated 4 months ago
- ☆25Updated 5 months ago
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆31Updated 2 weeks ago
- [ICLR 2024] This is the official implementation of our paper "Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Vi…☆11Updated last year
- [CVPR2025] Code Release for "FlexGS: Train Once, Deploy Everywhere with Many-in-One Flexible 3D Gaussian Splatting"☆43Updated 4 months ago
- BloomScene: Lightweight Structured 3D Gaussian Splatting for Crossmodal Scene Generation (AAAI 2025)☆17Updated 9 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆289Updated 2 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆78Updated 6 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆52Updated 3 weeks ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆151Updated last month
- ☆47Updated 4 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆115Updated 7 months ago
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated 3 weeks ago