Arking1995 / COHOLinks
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆10Updated last year
Alternatives and similar repositories for COHO
Users that are interested in COHO are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆101Updated last year
- [CVPR 2025] Open-World Amodal Appearance Completion☆41Updated 2 weeks ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆59Updated 2 weeks ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆53Updated this week
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated 2 weeks ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆107Updated 5 months ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆13Updated last month
- ☆17Updated last month
- Visual Spatial Tuning☆133Updated last week
- [NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆115Updated 2 weeks ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆30Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆24Updated 8 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆62Updated 4 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆56Updated 3 months ago
- Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)☆25Updated 3 weeks ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆42Updated 11 months ago
- Self-reimplemented version of 4D-LRM.☆62Updated 5 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆120Updated last week
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆34Updated 5 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆25Updated 5 months ago
- ☆37Updated last year
- OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models☆72Updated last month
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆53Updated last month
- Official implementation of PARIS3D (Accepted to ECCV 2024).☆27Updated last year
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆59Updated 4 months ago
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆19Updated 7 months ago
- ☆60Updated 7 months ago
- [NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation☆18Updated 11 months ago