Code for Retrieval-Augmented Perception (ICML 2025)
☆68Aug 10, 2025Updated 6 months ago
Alternatives and similar repositories for RAP
Users that are interested in RAP are comparing it to the libraries listed below
Sorting:
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆20Mar 25, 2024Updated last year
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- ☆13Apr 23, 2025Updated 10 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 4 months ago
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆19Apr 30, 2025Updated 10 months ago
- Official implementation of EgoThinker at NIPS 2025☆24Nov 25, 2025Updated 3 months ago
- [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆77Nov 20, 2025Updated 3 months ago
- Code for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge☆15Dec 31, 2024Updated last year
- Expression Snippet Transformer for Robust Video-based Facial Expression Recognition☆17Jan 27, 2024Updated 2 years ago
- The code for the paper "Dual Mutual Information Constraints for Discriminative Clustering"☆23Aug 22, 2024Updated last year
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆145Jun 20, 2024Updated last year
- The official implementation for Candidate Set Re-ranking for Composed Image Retrieval (TMLR) 01/2024☆20Feb 7, 2024Updated 2 years ago
- Official code of paper "GEMeX: A Large-Scale, Groundable, and Explainable Medical VQA Benchmark for Chest X-ray Diagnosis" [ICCV 2025]☆42Jun 29, 2025Updated 8 months ago
- This is the official code of "Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation, NeurIPS 23"☆26Dec 7, 2023Updated 2 years ago
- ☆28Feb 10, 2025Updated last year
- 综合项目实践项目学习记录+代码☆11Jun 18, 2022Updated 3 years ago
- 苏州大学每日健康情况自动化打卡脚本☆13Mar 30, 2022Updated 3 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆346Apr 20, 2025Updated 10 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆146Dec 26, 2024Updated last year
- [ISPRS 2024] LoveNAS: Towards Multi-Scene Land-Cover Mapping via Hierarchical Searching Adaptive Network☆33Dec 1, 2024Updated last year
- ☆33Nov 12, 2018Updated 7 years ago
- 一个桌面宠物程序,现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。☆11Nov 17, 2024Updated last year
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆44Sep 24, 2024Updated last year
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 7 years ago
- This repository contains the implementation for our work "TopoDiffusionNet: A Topology-aware Diffusion Model", accepted to ICLR 2025.☆21Apr 17, 2025Updated 10 months ago
- Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).☆44Feb 8, 2026Updated 3 weeks ago
- ☆11Nov 20, 2024Updated last year
- ☆12Jan 15, 2015Updated 11 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 7 months ago
- ☆10Nov 17, 2022Updated 3 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆44Sep 12, 2024Updated last year
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- [WMT 2022 champion system] Vega-MT model and inference scripts☆41Feb 10, 2023Updated 3 years ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆103Jan 30, 2024Updated 2 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago