harrytea / ROOT
ROOT: VLM based System for Indoor Scene Understanding and Beyond
☆20Updated last week
Alternatives and similar repositories for ROOT:
Users that are interested in ROOT are comparing it to the libraries listed below
- ☆13Updated 10 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 10 months ago
- ☆38Updated last year
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆46Updated 3 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks☆35Updated 2 months ago
- ☆22Updated last month
- ☆20Updated 3 weeks ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆32Updated 7 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆92Updated 6 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆20Updated 8 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆74Updated 5 months ago
- ☆58Updated last year
- Open implementation of "RandAR"☆51Updated 2 weeks ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆15Updated 2 months ago
- ☆23Updated 6 months ago
- ☆62Updated last year
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆40Updated 2 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆36Updated 10 months ago
- Collection of Highlight papers☆26Updated 8 months ago
- Open-Vocabulary Panoptic Segmentation☆21Updated 4 months ago
- Liquid: Language Models are Scalable Multi-modal Generators☆61Updated last month
- ☆26Updated last month
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆109Updated 5 months ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆67Updated 4 months ago
- A collection of vision foundation models unifying understanding and generation.☆40Updated 3 weeks ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 3 months ago
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆53Updated last month
- ☆34Updated 9 months ago