OPPOMKLab / recognize-anything
Codebase for the Recognize Anything Model (RAM)
☆78Updated last year
Alternatives and similar repositories for recognize-anything:
Users that are interested in recognize-anything are comparing it to the libraries listed below
- YOLO-World + EfficientViT SAM☆97Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆98Updated last year
- ☆180Updated 3 weeks ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆47Updated last week
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆240Updated 2 months ago
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆222Updated 7 months ago
- ☆105Updated 10 months ago
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆154Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆95Updated 9 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆130Updated last year
- ☆27Updated 6 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆402Updated last month
- Recognize Any Regions☆122Updated 4 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆90Updated 6 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆122Updated 6 months ago
- [ECCV 2024] Tokenize Anything via Prompting☆583Updated 4 months ago
- Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.☆18Updated 3 years ago
- Image Editing Anything☆114Updated 2 years ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆117Updated last year
- Scaling Vision Pre-Training to 4K Resolution☆154Updated last week
- VimTS: A Unified Video and Image Text Spotter☆77Updated 5 months ago
- using clip and sam to segment any instance you specify with text prompt of any instance names☆174Updated last year
- ☆91Updated 9 months ago
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆244Updated 6 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆50Updated 4 months ago
- Precision Search through Multi-Style Inputs☆69Updated 2 weeks ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆243Updated 3 weeks ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆314Updated last year
- ☆65Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆124Updated 8 months ago