HiThink-Research / MME-Finance
☆20Updated last week
Related projects ⓘ
Alternatives and complementary repositories for MME-Finance
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆29Updated 7 months ago
- A Survey on the Honesty of Large Language Models☆44Updated last month
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆79Updated 9 months ago
- ☆84Updated 10 months ago
- DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆15Updated 3 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆23Updated last month
- ☆37Updated 5 months ago
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆76Updated this week
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆45Updated 2 months ago
- my commonly-used tools☆47Updated 3 months ago
- ☆115Updated 3 months ago
- Official code for our paper "Model Composition for Multimodal Large Language Models"☆17Updated 6 months ago
- A RLHF Infrastructure for Vision-Language Models☆98Updated 5 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆136Updated last week
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆31Updated 2 weeks ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆153Updated 9 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆26Updated 4 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆48Updated last year
- This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vi…☆81Updated 3 weeks ago
- Attaching human-like eyes to the large language model. The codes of IEEE TMM paper "LMEye: An Interactive Perception Network for Large La…☆48Updated 3 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆27Updated last week
- [ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"☆52Updated last year
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"☆14Updated 5 months ago
- ☆71Updated 10 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆33Updated 11 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆18Updated 3 weeks ago
- ☆147Updated 4 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆24Updated this week
- ☆13Updated 11 months ago