PromptExpert / blogs
☆53Updated this week
Related projects ⓘ
Alternatives and complementary repositories for blogs
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆136Updated last week
- Efficient Multimodal Large Language Models: A Survey☆269Updated 2 months ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆286Updated 2 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆81Updated 6 months ago
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".☆224Updated 4 months ago
- [CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding☆205Updated last month
- [CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding☆284Updated 5 months ago
- ☆214Updated 7 months ago
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆445Updated last month
- This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…☆46Updated 3 months ago
- [NeurIPS 2023 Datasets and Benchmarks Track] LAMM: Multi-Modal Large Language Models and Applications as AI Agents☆300Updated 6 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆130Updated this week
- This is the official repository for Retrieval Augmented Visual Question Answering☆181Updated 2 months ago
- 🔥🔥MLVU: Multi-task Long Video Understanding Benchmark☆156Updated last week
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆230Updated 2 months ago
- ☆287Updated 9 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆55Updated 2 months ago
- 😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.☆144Updated 7 months ago
- Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Mod…☆266Updated 2 months ago
- Visual Instruction Tuning for Qwen2 Base Model☆19Updated 4 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆178Updated 7 months ago
- Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"☆163Updated 2 months ago
- HallE-Control: Controlling Object Hallucination in LMMs☆28Updated 7 months ago
- Awesome papers & datasets specifically focused on long-term videos.☆194Updated 3 weeks ago
- ✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis☆402Updated 4 months ago
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆280Updated 2 weeks ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆27Updated last week
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆205Updated this week
- A RLHF Infrastructure for Vision-Language Models☆98Updated 5 months ago
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆244Updated 4 months ago