Mr-Loevan / HSA-DPOLinks
[AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
☆27Updated 5 months ago
Alternatives and similar repositories for HSA-DPO
Users that are interested in HSA-DPO are comparing it to the libraries listed below
Sorting:
- [AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph R…☆17Updated last month
- [AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning☆26Updated 5 months ago
- [AAAI-2025] Official Codes for “Motif Guided Graph Transformer with Combinatorial Skeleton Prototype Learning for Skeleton-Based Person R…☆19Updated 4 months ago
- This is the official code implement for AAAI 2025 paper ``Defeasible Visual Entailment: Benchmark, Evaluator, and Reward-Driven Optimizat…☆23Updated 3 months ago
- Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"☆17Updated last month
- 【CVPR2025】IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification☆29Updated 3 months ago
- This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…☆43Updated last month
- 【AAAI2025】DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification☆55Updated 4 months ago
- ✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification☆44Updated 3 months ago
- AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction☆19Updated 2 months ago
- An official implementation of "Re-Attentional Controllable Video Diffusion Editing" in PyTorch. (AAAI 2025)☆26Updated 7 months ago
- [AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segm…☆35Updated 7 months ago
- AAAI 2025: Hierarchical Consensus Network for Multiview Feature Learning☆19Updated 5 months ago
- 【AAAI2025】MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt☆75Updated 2 months ago
- [AAAI 2025] CoPEFT: Fast Adaptation Framework for Multi-Agent Collaborative Perception with Parameter-Efficient Fine-Tuning☆22Updated 3 months ago
- [AAAI 2025] Motion Prior Knowledge Learning with Homogeneous Language Descriptions for Moving Infrared Small Target Detection☆29Updated last month
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆28Updated 7 months ago
- [AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models☆33Updated 5 months ago
- [AAAI'2025] The official implementation code of SIGMA☆31Updated 4 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆64Updated 3 weeks ago
- Pytorch Implementation of LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-Identification☆33Updated last week
- [AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story…☆34Updated last month
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆83Updated last month
- [CVPR 2025] Official PyTorch Code for "MMRL: Multi-Modal Representation Learning for Vision-Language Models" and its extension "MMRL++: P…☆57Updated 3 weeks ago
- [CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression☆45Updated 4 months ago
- [CVPR2025] Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models☆12Updated 2 months ago
- ☆66Updated 2 months ago
- ☆12Updated 5 months ago
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆87Updated 2 weeks ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Updated 7 months ago