gudehhh666 / Awesome_Scientific_AgentLinks

Awesome Scientific Agent

☆19

Alternatives and similar repositories for Awesome_Scientific_Agent

Users that are interested in Awesome_Scientific_Agent are comparing it to the libraries listed below

Sorting:

saccharomycetes / mllms_know
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆230Updated 3 months ago
Video-R1 / Awesome-Multimodal-Reasoning
Collections of Papers and Projects for Multimodal Reasoning.
☆105Updated 2 months ago
Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
☆66Updated 4 months ago
OpenDCAI / Awesome_MLLMs_Reasoning
☆102Updated last week
The-Martyr / Awesome-Modality-Priors-in-MLLMs
Latest Advances on Modality Priors in Multimodal Large Language Models
☆21Updated this week
PromptExpert / blogs
☆58Updated 4 months ago
ADaM-BJTU / Mind_with_eyes_Awesome_MLLMs_Reasoning
This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!
☆47Updated 3 months ago
zhaochen0110 / Awesome_Think_With_Images
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆698Updated 2 weeks ago
jungao1106 / ICoT
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆62Updated 2 months ago
Wang-Xiaodong1899 / CVPR25-MLLM-Paper-List
🔥CVPR 2025 Multimodal Large Language Models Paper List
☆147Updated 4 months ago
mrwu-mac / ControlMLLM
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆184Updated this week
Purshow / Awesome-LVLM-Hallucination
☆48Updated 7 months ago
HKUST-LongGroup / Awesome-MLLM-Benchmarks
☆129Updated 5 months ago
1zhou-Wang / MemVR
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆141Updated last week
NOVAglow646 / LLM-MLLM-paper-list
关于LLM和Multimodal LLM的paper list
☆41Updated 3 weeks ago
DAMO-NLP-SG / VCD
[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
☆296Updated 9 months ago
Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…
☆998Updated this week
xinyan-cxy / MINT-CoT
☆60Updated last month
chancharikmitra / CCoT
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆133Updated last year
Osilly / Awesome-Interleaving-Reasoning
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆84Updated last week
taishan1994 / llava-handbook
对llava官方代码的一些学习笔记
☆28Updated 9 months ago
deepcs233 / Visual-CoT
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …
☆345Updated 6 months ago
dvlab-research / VisionZip
Official repository for VisionZip (CVPR 2025)
☆321Updated last month
RupertLuo / VoCoT
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆69Updated last year
EIT-NLP / Layer_Select_Fuse_for_MLLM
[CVPR2025] Official implementation of the paper "Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practi…
☆25Updated last month
lhanchao777 / LVLM-Hallucinations-Survey
This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and contin…
☆73Updated 11 months ago
daixiangzi / Awesome-Token-Compress
A paper list of some recent works about Token Compress for Vit and VLM
☆547Updated last week
shikiw / Awesome-MLLM-Hallucination
Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)
☆94Updated 7 months ago
shufangxun / LLaVA-MoD
[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
☆183Updated 3 months ago
Code-kunkun / LamRA
[CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
☆128Updated 2 weeks ago