Luodian / RelateAnything

Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.

☆448

Alternatives and similar repositories for RelateAnything:

Users that are interested in RelateAnything are comparing it to the libraries listed below

Saiyan-World / grounded-segment-any-parts
Grounded Segment Anything: From Objects to Parts
☆393Updated last year
showlab / Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
☆800Updated last year
Vision-CAIR / ChatCaptioner
Official Repository of ChatCaptioner
☆460Updated last year
OptimalScale / DetGPT
☆762Updated 5 months ago
JialianW / GRiT
GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)
☆310Updated last year
baaivision / tokenize-anything
[ECCV 2024] Tokenize Anything via Prompting
☆557Updated last month
facebookresearch / VLPart
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
☆362Updated last year
OpenGVLab / all-seeing
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …
☆472Updated 5 months ago
jshilong / GPT4RoI
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
☆521Updated 7 months ago
IDEA-Research / OpenSeeD
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆679Updated 11 months ago
shikras / shikra
☆756Updated 6 months ago
facebookresearch / ov-seg
This is the official PyTorch implementation of the paper Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP.
☆704Updated last year
microsoft / X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,297Updated last year
Jingkang50 / OpenPSG
Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22
☆433Updated last year
ngthanhtin / owlvit_segment_anything
Combining OwlViT with Segment Anything - Open-vocabulary Detection and Segmentation (Text-conditioned, and Image-conditioned)
☆158Updated last year
NVlabs / ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
☆876Updated 6 months ago
FoundationVision / UniRef
[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces
☆234Updated last year
berkeley-hipie / HIPIE
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
☆276Updated 9 months ago
YuchenLiu98 / COMM
Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
☆191Updated last week
xk-huang / segment-caption-anything
[CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…
☆209Updated 3 months ago
JerryX1110 / awesome-segment-anything-extensions
Segment-anything related awesome extensions/projects/repos.
☆343Updated last year
RockeyCoss / Prompt-Segment-Anything
This is an implementation of zero-shot instance segmentation using Segment Anything.
☆306Updated last year
cientgu / InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
☆405Updated 8 months ago
SunzeY / AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
☆765Updated 5 months ago
maxi-w / CLIP-SAM
Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.
☆352Updated last year
RunpeiDong / DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆409Updated last month
LLaVA-VL / LLaVA-Interactive-Demo
LLaVA-Interactive-Demo
☆360Updated 5 months ago
ContextualAI / lens
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
☆351Updated last year
shenyunhang / APE
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
☆502Updated 8 months ago
allenai / visprog
Official code for VisProg (CVPR 2023 Best Paper!)
☆701Updated 4 months ago