THUNLP-MT/Scaffold

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUNLP-MT/Scaffold)

THUNLP-MT / Scaffold

Scaffold Prompting to promote LMMs

☆46

Alternatives and similar repositories for Scaffold

Users that are interested in Scaffold are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yixuan730 / DetToolChain
View on GitHub
Dettoolchain: A new prompting paradigm to unleash detection ability of MLLM
☆45Oct 12, 2024Updated last year
THUNLP-MT / CODIS
View on GitHub
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
☆13Oct 14, 2024Updated last year
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
amberxie88 / lapp
View on GitHub
Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)
☆27Sep 1, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 8 months ago
chancharikmitra / CCoT
View on GitHub
[CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"
☆142Jun 20, 2024Updated 2 years ago
claws-lab / projection-in-MLLMs
View on GitHub
Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'
☆18Jul 21, 2024Updated 2 years ago
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
liyongqi67 / LTRGR
View on GitHub
☆21Aug 9, 2024Updated last year
ac-rad / anyplace
View on GitHub
Official implementation of "AnyPlace: Learning Generalized Object Placement for Robot Manipulation"
☆96Mar 25, 2025Updated last year
aszala / EnvGen
View on GitHub
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆40Jul 13, 2024Updated 2 years ago
MedHK23 / IMT-CXR
View on GitHub
☆20Jan 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luckybird1994 / SAMCOD
View on GitHub
☆35Apr 14, 2023Updated 3 years ago
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
Liqq1 / awesome-medical-vision-and-language-pretraining
View on GitHub
The collection of medical VLP papars
☆20Jul 24, 2024Updated last year
ExplainableML / ImageSelect
View on GitHub
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Jul 10, 2023Updated 3 years ago
UCSC-VLAA / ReasoningEval
View on GitHub
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆43Jun 6, 2025Updated last year
YiyangZhou / CSR
View on GitHub
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆87Oct 26, 2025Updated 8 months ago
metthueshoo / graphCut
View on GitHub
Graph Cut Algorithm in CUDA
☆28Jun 1, 2019Updated 7 years ago
MLLMKCBENCH / MLLMKC
View on GitHub
【AAAI 2026 🔥】A benchmark that evaluates multimodel knowledge conflicts for large multimodal model
☆25May 27, 2025Updated last year
Lackel / AGLA
View on GitHub
[CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
☆68Jul 16, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
Liu-Jinxin / ur5e_joystick_control
View on GitHub
☆10Dec 15, 2024Updated last year
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
ChantalMP / RaDialog_v2
View on GitHub
LLaVa Version of RaDialog
☆26May 27, 2025Updated last year
meetdavidwan / crg
View on GitHub
PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"
☆39Mar 4, 2024Updated 2 years ago
alexanderswerdlow / faster
View on GitHub
☆29Jun 30, 2026Updated 3 weeks ago
ZhuXMMM / Afford-X-Project
View on GitHub
☆17Mar 10, 2025Updated last year
ys-zong / VL-ICL
View on GitHub
[ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
☆69Sep 20, 2025Updated 10 months ago
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
UK-MAC / TeaLeaf
View on GitHub
A mini-app to solve the heat conduction equation
☆15Jul 1, 2020Updated 6 years ago
82magnolia / panoramic-depth-calibration
View on GitHub
Official PyTorch implementation of Calibrating Panoramic Depth Estimation for Practical Localization and Mapping (ICCV 2023).
☆10Oct 9, 2025Updated 9 months ago
SALT-NLP / PopupAttack
View on GitHub
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
☆51Dec 23, 2024Updated last year
wentaoyuan / RoboPoint
View on GitHub
A Vision-Language Model for Spatial Affordance Prediction in Robotics
☆227Jul 17, 2025Updated last year
AgentForceTeamOfficial / Baby-AIGS
View on GitHub
Official Implementation of the Baby-AIGS system
☆24Nov 25, 2024Updated last year
AdaCheng / EgoThink
View on GitHub
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…
☆64Mar 25, 2025Updated last year
simonfuhrmann / firepass
View on GitHub
A password manager
☆12Jun 22, 2026Updated 3 weeks ago