alwynpan / uom-comp90024Links

Demo Code for Subject COMP90024

☆12

Alternatives and similar repositories for uom-comp90024

Users that are interested in uom-comp90024 are comparing it to the libraries listed below

Sorting:

Gumpest / SparseVLMs
[ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".
☆193Updated 5 months ago
EIT-NLP / Layer_Select_Fuse_for_MLLM
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆39Updated last month
liuting20 / MustDrop
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Updated 10 months ago
gszfwsb / AutoGnothi
Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"
☆23Updated 8 months ago
silicx / GoldFromOres-BiLP
Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)
☆25Updated last year
Ghy0501 / HiDe-LLaVA
[ACL'25 Main] Official Implementation of HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Languag…
☆40Updated 2 months ago
gszfwsb / Awesome-Dataset-Reduction
A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…
☆58Updated 10 months ago
kaustpradalab / research-handboook
☆81Updated last year
ZhangqiJiang07 / middle_layers_indicating_hallucinations
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆53Updated last month
William-wAng618 / roboticAttack
Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
☆50Updated 3 months ago
seilk / VisAttnSink
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆72Updated 9 months ago
tmlr-group / SMM
[ICML 2024 Spotlight] "Sample-specific Masks for Visual Reprogramming-based Prompting"
☆12Updated 11 months ago
cokeshao / Awesome-Multimodal-Token-Compression
Survey: https://arxiv.org/pdf/2507.20198
☆218Updated last month
xmed-lab / TAM
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs
☆132Updated 3 months ago
silicx / LoRS_Distill
Code for our ICML'24 on multimodal dataset distillation
☆41Updated last year
williamium3000 / awesome-mllm-grounding
Awesome paper for multi-modal llm with grounding ability
☆19Updated last month
tanhuajie / Reason-RFT
[NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.
☆233Updated last month
weijiawu / Awesome-Visual-Reinforcement-Learning
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
☆339Updated 2 weeks ago
DelinQu / awesome-vision-language-action-model
Latest Advances on Vison-Language-Action Models.
☆119Updated 8 months ago
zhyang2226 / OPA-DPO
[CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
☆87Updated 2 months ago
lzhxmu / VTW
Code release for VTW (AAAI 2025 Oral)
☆64Updated 3 weeks ago
Theia-4869 / FasterVLM
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆97Updated 5 months ago
xinyan-cxy / MINT-CoT
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆88Updated 2 months ago
bytedance / LVLM_Interpretation
The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"
☆50Updated 6 months ago
WayneJin0918 / SOTA-paper-rating.io
A tiny paper rating web
☆38Updated 8 months ago
Video-R1 / Awesome-Multimodal-Reasoning
Collections of Papers and Projects for Multimodal Reasoning.
☆105Updated 7 months ago
jiayuww / SpatialEval
[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs
☆54Updated 10 months ago
pointarena / pointarena
☆29Updated 3 months ago
jungao1106 / ICoT
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆94Updated this week
mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM
A paper list for spatial reasoning
☆411Updated last week