KAIST-Visual-AI-Group / VG-AVSLinks
Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection
☆18Updated last week
Alternatives and similar repositories for VG-AVS
Users that are interested in VG-AVS are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of CorrespondentDream: Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences (CVPR 2024 Po…☆19Updated last year
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆19Updated 7 months ago
- ☆47Updated 2 years ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆49Updated 10 months ago
- Unofficial implementation of E-LatentLPIPS in Diffusion2GAN☆19Updated last year
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆30Updated 7 months ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Updated 3 months ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆93Updated last month
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Updated 10 months ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Updated 7 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆30Updated 3 months ago
- ☆15Updated 4 months ago
- [Official Implementation] Improving Editability in Image Generation with Layer-wise Memory, CVPR 2025☆36Updated 4 months ago
- ☆25Updated 11 months ago
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆82Updated 6 months ago
- Training recipe for SpatialReasoner☆33Updated 4 months ago
- [ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"☆27Updated 2 months ago
- Official repository for the paper "Instance-Wise Holistic Order Prediction in Natural Scenes".☆26Updated 2 years ago
- [NeurIPS'25] Official implementation of "D^2USt3R: Enhancing 3D Reconstruction with 4D Pointmaps for Dynamic Scenes"☆68Updated last month
- Implementation for DIY-SC paper.☆22Updated 6 months ago
- [ECCV2024] Official Implementation of "NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image"☆32Updated last year
- An official implementation of RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videos☆37Updated last year
- Official code implementation of "DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation"(NeurIPS 2023)☆70Updated last year
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆43Updated 5 months ago
- An open source Multi-View Latent Diffusion Model☆41Updated 8 months ago
- [ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"☆18Updated 5 months ago
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆53Updated last year
- ☆38Updated 3 weeks ago
- ☆19Updated 3 months ago
- [CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"☆28Updated 8 months ago