Codebase for the Recognize Anything Model (RAM)
☆88Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for recognize-anything
Users that are interested in recognize-anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] Tokenize Anything via Prompting☆601Dec 11, 2024Updated last year
- Open-source and strong foundation image recognition models.☆3,617Feb 18, 2025Updated last year
- Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …☆17,513Sep 5, 2024Updated last year
- Class project for COMP-781, Robotics. This is a CUDA-based collision detector for motion planning.☆13Apr 29, 2019Updated 6 years ago
- ☆17Aug 18, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An end-to-end infrared and visible light enhancement fusion algorithm based on SwinTransformer☆13Feb 14, 2025Updated last year
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆18Mar 3, 2024Updated 2 years ago
- ☆14Feb 18, 2023Updated 3 years ago
- IGConv: Implicit Grid Convolution for Multi-Scale Image Super-Resolution☆30Nov 15, 2024Updated last year
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,823Jul 10, 2025Updated 9 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆23Jul 16, 2025Updated 9 months ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆12Nov 27, 2022Updated 3 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆10,010Aug 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [RA-L] SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization☆28Nov 24, 2025Updated 4 months ago
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,774Aug 19, 2024Updated last year
- Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models☆208Jan 8, 2025Updated last year
- Official implementation of RGB-D co-attention network published at PR.☆10Jan 21, 2022Updated 4 years ago
- EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything☆2,474Dec 24, 2024Updated last year
- [IEEE TMI] This is the official repository for "UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification"☆19Aug 2, 2024Updated last year
- Download, browse and delete models in ComfyUI.☆12Oct 9, 2024Updated last year
- This is a repo for paper of "Sam-Based Instance Segmentation Models for the Automation of Structural Damage Detection"☆13Feb 22, 2025Updated last year
- MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer☆252Apr 3, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the paper "Visual Recognition by Request".☆43Nov 1, 2022Updated 3 years ago
- Implements RNNPool and SoftPool for CNNs.☆14Jan 29, 2021Updated 5 years ago
- Official implementation of TagAlign☆37Dec 11, 2024Updated last year
- ☆12Jul 3, 2024Updated last year
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆10Mar 13, 2023Updated 3 years ago
- Offical implementation of CVPR 2026 paper SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving.☆59Mar 30, 2026Updated 2 weeks ago
- [AAAI2026] X-SAM: From Segment Anything to Any Segmentation☆365Apr 8, 2026Updated last week
- 3D LiDAR place recognition targeting the heterogeneous robots scenario☆31Feb 9, 2026Updated 2 months ago
- ☆14Mar 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PlayStation1 MDEC compression tools☆11Dec 31, 2020Updated 5 years ago
- PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling, TIP 2022☆11Nov 30, 2022Updated 3 years ago
- ☆16Mar 13, 2023Updated 3 years ago
- [CVPR 2025 Highlight] Official repository for CoMM Dataset☆53Dec 31, 2024Updated last year
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,097Jan 21, 2025Updated last year
- [CoRL2023] Open-Vocabulary Scene-Graph☆72Dec 30, 2023Updated 2 years ago
- Reproduction of the official SAF-FCOS repo.☆13Dec 4, 2023Updated 2 years ago