NK-CS-ZZL / GS-RORLinks
Official Release of ACM TOG 2025 paper -- GS-ROR
☆31Updated 2 months ago
Alternatives and similar repositories for GS-ROR
Users that are interested in GS-ROR are comparing it to the libraries listed below
Sorting:
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)☆14Updated 5 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction' (ICCV 2025)☆75Updated 3 months ago
- ☆51Updated last month
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆57Updated 3 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆30Updated 6 months ago
- [ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment☆35Updated 2 weeks ago
- An open source codebase for object detection based on Jittor☆19Updated 8 months ago
- Official Release of ICCV 2025 paper -- DiscretizedSDF☆93Updated 2 months ago
- The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs☆110Updated 3 months ago
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆72Updated last month
- ☆11Updated 10 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆42Updated 7 months ago
- Exploring Feature Self-relation for Self-supervised Transformer (TPAMI 2023)☆21Updated 6 months ago
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆110Updated last week
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆44Updated 6 months ago
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆56Updated 5 months ago
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆48Updated last month
- ☆16Updated 4 months ago
- [NeruIPS 25] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆50Updated this week
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆25Updated last month
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆69Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆57Updated 9 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆54Updated last year
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆197Updated 3 months ago
- [SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation☆55Updated 2 weeks ago
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆22Updated 9 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆177Updated 5 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆59Updated 3 months ago
- [ICML2025 Oral] ReferSplat: Referring Segmentation in 3D Gaussian Splatting☆116Updated last month
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆36Updated 7 months ago