TencentARC / SGAT4PASSLinks
[IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
☆33Updated 2 years ago
Alternatives and similar repositories for SGAT4PASS
Users that are interested in SGAT4PASS are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆80Updated last year
- ☆113Updated 3 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆60Updated last year
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆38Updated last month
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆58Updated 4 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆82Updated 11 months ago
- Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024☆32Updated last month
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆36Updated 5 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated last week
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆16Updated 8 months ago
- Official implementation of "Diffusion Model for Dense Matching" (ICLR'24 Oral)☆184Updated last year
- [NeurIPS DB 2025] IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering☆41Updated last month
- [ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting☆120Updated 2 months ago
- Official implementation of "Can Language Understand Depth?"☆82Updated 3 years ago
- Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric…☆51Updated last month
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆213Updated 9 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 10 months ago
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆55Updated 6 months ago
- Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023☆109Updated last year
- [WACV 2025, Best Student Paper, Oral] GeoDiffuser: Geometry-Based Image Editing with Diffusion Models☆20Updated 7 months ago
- [NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆113Updated 2 weeks ago
- [CVPR'2025] EntitySAM: Segment Everything in Video☆52Updated 4 months ago
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆141Updated last week
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- ☆80Updated 5 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆49Updated last month
- [ICLR 25'] InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting☆21Updated 7 months ago
- [WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence☆33Updated 4 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆45Updated last year
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆52Updated 3 months ago