showlab / LOVA3
(NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment
☆63Updated this week
Related projects ⓘ
Alternatives and complementary repositories for LOVA3
- An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks☆58Updated 2 weeks ago
- Weakly supverised individual counting☆29Updated 3 months ago
- ☆32Updated 3 months ago
- ☆43Updated last year
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆65Updated last month
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆49Updated last month
- ☆42Updated 9 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆107Updated last month
- ☆42Updated 3 months ago
- Domain-Controlled Prompt Learning (AAAI2024)☆113Updated 10 months ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆48Updated 3 months ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆38Updated 6 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆27Updated 11 months ago
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆44Updated last year
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆126Updated 3 months ago
- ☆84Updated 2 weeks ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆42Updated 2 months ago
- A PyTorch implementation for Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks☆51Updated 4 years ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆214Updated 3 weeks ago
- For paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"☆48Updated 2 years ago
- 🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…☆62Updated this week
- ☆55Updated last year
- ☆104Updated last week
- Official implementation of "Generating images with 3D annotations using diffusion models".☆58Updated 2 months ago
- NWPU足基 ATOM_LINKER 唐天扬负责 硬件组☆52Updated 2 years ago
- LoRA fine-tuning Mistral-7b-v2 on PR Task☆28Updated 3 months ago
- Implementation of RSGC-BD (Blur Detection)☆58Updated 2 months ago
- linkedin, seek job information crawler☆105Updated 3 weeks ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆125Updated last week