TencentARC / SGAT4PASS
This is the official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation (IJCAI 2023)
☆30Updated last year
Alternatives and similar repositories for SGAT4PASS
Users that are interested in SGAT4PASS are comparing it to the libraries listed below
Sorting:
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆13Updated 2 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆76Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆56Updated last year
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆44Updated 9 months ago
- Official repository for paper "Open Panoramic Segmentation" (OPS), ECCV 2024☆27Updated 2 weeks ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction'☆30Updated last week
- [ICLR 2025] Layout-Your-3D: Controllable and Precise 3D Generation with 2D Blueprint☆11Updated 3 months ago
- ☆33Updated 7 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 4 months ago
- ☆60Updated 3 months ago
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆77Updated 5 months ago
- ☆20Updated last month
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆39Updated last year
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆35Updated last week
- [NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion☆35Updated 7 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆40Updated last year
- An unofficial implementation of DreamScene360.☆82Updated 11 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 4 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆52Updated last month
- A collection of vision foundation models unifying understanding and generation.☆55Updated 4 months ago
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆91Updated 4 months ago
- (CVPR 2024) NViST: In the wild New View Synthesis from a Single Image with Transformers☆40Updated 7 months ago
- ☆59Updated last month
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆31Updated 2 months ago
- Official implementation of "WorDepth: Variational Language Prior for Monocular Depth Estimation"☆37Updated 3 months ago
- Repository of Trans4PASS (accepted to CVPR2022)☆91Updated 2 years ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆40Updated last year
- LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model (CVPR2024)☆76Updated 9 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆82Updated 2 weeks ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆78Updated this week