ziplab / CoVLinks
CoV: Chain-of-View Prompting for Spatial Reasoning
☆33Updated last week
Alternatives and similar repositories for CoV
Users that are interested in CoV are comparing it to the libraries listed below
Sorting:
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Updated 3 months ago
- MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head☆68Updated last week
- ☆52Updated 2 weeks ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆19Updated 2 weeks ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆56Updated 3 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token"☆42Updated 7 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆166Updated 3 months ago
- Official repo for StyleMe3D☆27Updated 8 months ago
- ☆25Updated 9 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Updated last year
- The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"☆94Updated 3 weeks ago
- Self-reimplemented version of 4D-LRM.☆65Updated 7 months ago
- Code implementation for: From Virtual Games to Real-World Play☆44Updated 6 months ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Updated 2 years ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆81Updated 5 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆113Updated 3 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 2 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆31Updated 2 weeks ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Updated last year
- Scaling Spatial Intelligence with Multimodal Foundation Models☆156Updated last week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆203Updated 3 months ago
- [ICCV 2025] Official implementation of "What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?"☆18Updated 5 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 5 months ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆93Updated this week
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 8 months ago
- ☆37Updated last week
- ☆27Updated 6 months ago
- ☆16Updated last year
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆82Updated 3 months ago