TencentARC / SGAT4PASSLinks
[IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
☆31Updated 2 years ago
Alternatives and similar repositories for SGAT4PASS
Users that are interested in SGAT4PASS are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆78Updated last year
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆73Updated 2 weeks ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆38Updated 5 months ago
- [ACM MM2024] Official implementation of the paper "GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space …☆66Updated 9 months ago
- The official repository for paper "MLLMs Need 3D-Aware Representation Supervision for Scene Understanding"☆79Updated 2 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆202Updated 6 months ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆53Updated 2 weeks ago
- [ICCV 2023] CLIP2Point: Transfer CLIP to Point Cloud Classification with Image-Depth Pre-training☆120Updated last year
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆16Updated 5 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆46Updated 3 weeks ago
- [Arxiv 25'] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆38Updated last month
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆87Updated 4 months ago
- ☆95Updated 4 months ago
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆24Updated 2 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆18Updated 2 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆58Updated last year
- Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024☆30Updated 3 months ago
- Official Release of ICCV 2025 paper -- DiscretizedSDF☆82Updated 2 weeks ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 7 months ago
- ☆26Updated 4 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆82Updated 8 months ago
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆48Updated last year
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆123Updated last year
- [ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting☆30Updated last week
- [NeurIPS 2024] OpenDlign: Enhancing Open-World 3D Learning with Depth-Aligned Images☆28Updated 10 months ago
- ☆100Updated 4 months ago
- ☆75Updated 2 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆54Updated last month
- VideoDirector [CVPR 2025]☆25Updated 4 months ago