Code for Retrieval-Augmented Perception (ICML 2025)
☆69Apr 22, 2026Updated last month
Alternatives and similar repositories for RAP
Users that are interested in RAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- 🚀enhanced GRPO with more verifiable rewards and real-time evaluators☆37Jan 27, 2026Updated 3 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆80Nov 20, 2025Updated 6 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"☆50Oct 27, 2025Updated 6 months ago
- Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models☆28Mar 24, 2025Updated last year
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆19Dec 25, 2025Updated 5 months ago
- ☆51May 7, 2026Updated 2 weeks ago
- ☆22Sep 23, 2025Updated 8 months ago
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆21Apr 30, 2025Updated last year
- ☆13Apr 23, 2025Updated last year
- [MICCAI 2025] Bridging the Gap in Missing Modalities: Leveraging Knowledge Distillation and Style Matching for Brain Tumor Segmentation☆21Jul 13, 2025Updated 10 months ago
- (TGRS 2024) OrientedFormer: An End-to-End Transformer-Based Oriented Object Detector in Remote Sensing Images☆48Jul 14, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆143Jun 20, 2024Updated last year
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆44May 21, 2025Updated last year
- ☆31Feb 10, 2025Updated last year
- Creating High-Fidelity Synthetic GPS Trajectory Dataset for Urban Mobility Analysis☆22Mar 12, 2026Updated 2 months ago
- ☆24Jun 18, 2025Updated 11 months ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆44Jul 2, 2025Updated 10 months ago
- Towards Robust Multimodal Sentiment Analysis with Incomplete Data☆114Feb 24, 2026Updated 3 months ago
- [ACM TOMM] Official implementation of "TextCoT: Zoom-In for Enhanced Multimodal Text-Rich Image Understanding"☆45Feb 27, 2026Updated 2 months ago
- Official Implementation of "IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models"☆17Jun 5, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆373Apr 20, 2025Updated last year
- ☆24Nov 29, 2024Updated last year
- Android malware classification using both .java files and .so files☆11Jan 19, 2019Updated 7 years ago
- Official implementation of EgoThinker at NIPS 2025☆28Nov 25, 2025Updated 6 months ago
- Evaluation of ML models in Android malware classification, adversarial attacks on DNNs & defense mechanisms☆13Jan 14, 2020Updated 6 years ago
- ☆13Nov 2, 2025Updated 6 months ago
- [ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"☆38Jul 12, 2024Updated last year
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆27Dec 7, 2023Updated 2 years ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆26Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated 11 months ago
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 11 months ago
- Pytorch implementation for codes in Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis (Accepted by IEEE…☆15Feb 2, 2024Updated 2 years ago
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago
- 苏州大学每日健康情况自动化打卡脚本☆13Mar 30, 2022Updated 4 years ago
- 综合项目实践项目学习记录+代码☆11Jun 18, 2022Updated 3 years ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆55Feb 1, 2024Updated 2 years ago