NOVAglow646/LLM-MLLM-paper-list

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NOVAglow646/LLM-MLLM-paper-list)

NOVAglow646 / LLM-MLLM-paper-list

关于LLM和Multimodal LLM的paper list

☆60

Alternatives and similar repositories for LLM-MLLM-paper-list

Users that are interested in LLM-MLLM-paper-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NOVAglow646 / OOD-Generalization-Paper-Reading-Notes
View on GitHub
OOD Generalization相关文章的阅读笔记
☆37Dec 9, 2024Updated last year
weixuan-wang123 / SADI
View on GitHub
☆19Sep 1, 2025Updated 10 months ago
itsqyh / Awesome-LMMs-Mechanistic-Interpretability
View on GitHub
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…
☆215Mar 4, 2026Updated 4 months ago
NishilBalar / Awesome-LVLM-Hallucination
View on GitHub
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
☆325Feb 8, 2026Updated 5 months ago
maxjcohen / vqvae
View on GitHub
VQ-VAE implementation in pytorch, supporting EMA and Gumbel trainings. Applicable for images and time series.
☆11Oct 19, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
joaanna / disentangling_spelling_in_clip
View on GitHub
☆36Jun 22, 2023Updated 3 years ago
LALBJ / PAI
View on GitHub
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
☆171Nov 6, 2024Updated last year
MLRM-Halu / MLRM-Halu
View on GitHub
[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
☆82May 31, 2025Updated last year
shikiw / OPERA
View on GitHub
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…
☆411Aug 24, 2024Updated last year
luo-junyu / COUPLE
View on GitHub
☆12Jul 31, 2024Updated last year
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
gszfwsb / AutoGnothi
View on GitHub
Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"
☆23Mar 4, 2025Updated last year
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AntResearchNLP / AlignXplore
View on GitHub
Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals
☆11Jan 8, 2026Updated 6 months ago
underdoc-wang / EAST-Net
View on GitHub
[AAAI'22] Event-Aware Multimodal Mobility Nowcasting
☆14Sep 12, 2022Updated 3 years ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
YiyangZhou / LURE
View on GitHub
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆158Apr 30, 2024Updated 2 years ago
sdc17 / CopT
View on GitHub
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning
☆18May 21, 2026Updated 2 months ago
kayzliu / godm
View on GitHub
Data Augmentation for Supervised Graph Outlier Detection with Latent Diffusion Models
☆15Sep 3, 2025Updated 10 months ago
WilliamZR / ProTrix
View on GitHub
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
☆17Nov 15, 2024Updated last year
AIM3-RUC / VideoIC
View on GitHub
Danmuku dataset
☆12Jul 7, 2023Updated 3 years ago
xyltt / LPT
View on GitHub
This repo contains the code for Late Prompt Tuning.
☆12Dec 22, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
yao8839836 / cp
View on GitHub
☆13Feb 17, 2025Updated last year
iOPENCap / awesome-unimodal-training
View on GitHub
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
deepneuralmachine / seq2act-tensorflow
View on GitHub
Seq2act: Mapping Natural Language Instructions to Mobile UI Action Sequences from Google research
☆15Jul 13, 2020Updated 6 years ago
isXinLiu / Awesome-MLLM-Safety
View on GitHub
Accepted by IJCAI-24 Survey Track
☆233Aug 25, 2024Updated last year
nickjiang2378 / vlm-hallucinations
View on GitHub
[ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"
☆105Nov 30, 2025Updated 7 months ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yuanpinz / awesome-deep-multimodal-reasoning
View on GitHub
Collect the awesome works evolved around reasoning models like O1/R1 in visual domain
☆55Jul 21, 2025Updated last year
YeeZ93 / Awesome-Object-Centric-Learning
View on GitHub
A curated list of researches in object-centric learning
☆11Oct 14, 2024Updated last year
fc2869 / lo-fit
View on GitHub
LoFiT: Localized Fine-tuning on LLM Representations
☆45Jan 15, 2025Updated last year
X-PLUG / mPLUG-HalOwl
View on GitHub
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆100Jan 29, 2024Updated 2 years ago
MCG-NKU / SERE
View on GitHub
Exploring Feature Self-relation for Self-supervised Transformer (TPAMI 2023)
☆21Apr 30, 2025Updated last year
FrankYang-17 / RealUnify
View on GitHub
☆27Oct 10, 2025Updated 9 months ago
zyf12389 / LayoutGAN-Alpha
View on GitHub
Implementation of LayoutGAN https://arxiv.org/abs/1901.06767
☆17May 12, 2019Updated 7 years ago