ziplab / CoVLinks
CoV: Chain-of-View Prompting for Spatial Reasoning
☆50Updated 2 weeks ago
Alternatives and similar repositories for CoV
Users that are interested in CoV are comparing it to the libraries listed below
Sorting:
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Updated 4 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Updated 3 months ago
- ☆63Updated last month
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆33Updated last month
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token"☆42Updated 8 months ago
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)☆114Updated 2 weeks ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆167Updated 4 months ago
- ☆25Updated 10 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆47Updated 2 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"☆95Updated last month
- Official repo for StyleMe3D☆28Updated 9 months ago
- Official Repo for Fast-SAM3D: 3Dfy Anything in Images but Faster☆40Updated this week
- Code implementation for: From Virtual Games to Real-World Play☆45Updated 7 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆207Updated 3 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 3 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Updated 2 years ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆43Updated 6 months ago
- ☆21Updated 2 months ago
- ☆27Updated 6 months ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆82Updated 6 months ago
- Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"☆76Updated last month
- Self-reimplemented version of 4D-LRM.☆65Updated 8 months ago
- Scaling Spatial Intelligence with Multimodal Foundation Models☆170Updated this week
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 9 months ago
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆98Updated 3 weeks ago
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆18Updated 8 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Updated 11 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆116Updated 4 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Updated 10 months ago