SHWplus / DAT_BenchmarkLinks
[NIPS 2025] Open-World Drone Active Tracking with Goal-Centered Rewards
☆10Updated last month
Alternatives and similar repositories for DAT_Benchmark
Users that are interested in DAT_Benchmark are comparing it to the libraries listed below
Sorting:
- This is the source code for Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy (ICLR20…☆43Updated last year
- An up-to-date list of progress made in next-generation AI.☆11Updated 2 years ago
- [TIP 2025] This is an official PyTorch implementation of "Zero-Shot Skeleton-Based Action Recognition With Prototype-Guided Feature Align…☆29Updated 3 months ago
- [ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"☆102Updated 2 weeks ago
- 🌟 手把手教你在论文中插入代码链接☆22Updated 2 months ago
- Instruction Tuning in Continual Learning paradigm☆62Updated 8 months ago
- Awsome of VLM-CL. Continual Learning for VLMs: A Survey and Taxonomy Beyond Forgetting☆103Updated last week
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆335Updated 2 months ago
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆284Updated 6 months ago
- [ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass☆195Updated 2 years ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆687Updated last month
- [CVPR 2025] Official implementation of paper "MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders".☆44Updated 4 months ago
- Code for paper "Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters" CVPR2024☆253Updated last month
- [IEEE TBD 2023] IEMask R-CNN: Information-enhanced Mask R-CNN☆16Updated 2 years ago
- [CVPR'25 Oral] LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models☆37Updated 2 months ago
- ☆14Updated 11 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆709Updated last week
- CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning☆17Updated last year
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆538Updated 3 months ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆373Updated 4 months ago
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆82Updated last year
- Unofficial code for VPT(Visual Prompt Tuning) paper of arxiv 2203.12119☆162Updated 2 years ago
- [ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…☆37Updated last month
- Multimodal Large Language Model (MLLM) Tuning Survey: Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model☆83Updated 2 months ago
- The code of "Logits DeConfusion with CLIP for Few-Shot Learning" (CVPR 2025)☆59Updated 4 months ago
- [ICML2025] Test-Time Learning for Large Language Models☆27Updated 2 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆246Updated 2 months ago
- ☆71Updated 10 months ago
- ☆33Updated last year
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆84Updated last month