ntu-nail / CE7455Links
☆37Updated last month
Alternatives and similar repositories for CE7455
Users that are interested in CE7455 are comparing it to the libraries listed below
Sorting:
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆55Updated 9 months ago
- OpenReivew Submission Visualization (ICLR 2024/2025)☆151Updated 7 months ago
- ☆80Updated 2 months ago
- ☆38Updated last week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆74Updated 6 months ago
- ☆131Updated 2 weeks ago
- [ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"☆237Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆86Updated last year
- ☆16Updated last month
- Visualizing the attention of vision-language models☆176Updated 3 months ago
- ☆19Updated last month
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆32Updated 4 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆30Updated 7 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆78Updated 3 months ago
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆149Updated last year
- CHAIR metric is a rule-based metric for evaluating object hallucination in caption generation.☆29Updated last year
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated last month
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆40Updated 2 weeks ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆38Updated 11 months ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆25Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆203Updated 6 months ago
- Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)☆73Updated last week
- ☆17Updated 3 years ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆83Updated 7 months ago
- Paper List of Inference/Test Time Scaling/Computing☆239Updated last week
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆214Updated this week
- [ICLR 2024] Code for the paper "Sparse MoE with Language-Guided Routing for Multilingual Machine Translation"☆9Updated last year
- TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆42Updated last week
- ☆53Updated 7 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆68Updated 3 months ago