langgptai/Awesome-Multimodal-Prompts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/langgptai/Awesome-Multimodal-Prompts)

langgptai / Awesome-Multimodal-Prompts

Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.

☆288

Alternatives and similar repositories for Awesome-Multimodal-Prompts

Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EmbraceAGI / AutoNetGen
View on GitHub
让 AI 设计 AI，让大模型帮助小模型进化，用魔法创造魔法！ Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petit…
☆97Oct 16, 2023Updated 2 years ago
IDEA-Research / hana
View on GitHub
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Dec 22, 2022Updated 3 years ago
EmbraceAGI / Awesome-AI-GPTs
View on GitHub
Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking
☆1,191Jun 27, 2024Updated 2 years ago
RL4M / MED-PEFT
View on GitHub
☆22Jan 16, 2024Updated 2 years ago
yzfly / TokenCode
View on GitHub
为 Token 燃烧而生：token 越来越便宜，质量永远稀缺——与其省 token，不如把它当燃料烧。同一道题派最多 1000 个 agent 并行竞赛、裁判择优，用冗余换质量。Go 写的开源终端 Coding Agent，类 Claude Code，可接入任意模型，对团…
☆32Jul 7, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EmbraceAGI / AIGoodGames
View on GitHub
🎯 AI 游戏，编织代码、文字，如梦如幻，如诗如歌。
☆377Nov 15, 2023Updated 2 years ago
zhutyler21 / AIPainting-Structured-Prompts
View on GitHub
利用这个模板，你可以结构化的生成用于进行AI绘画创作的Prompt，适用于DALLE和MidJourney等多个平台。
☆23Mar 8, 2024Updated 2 years ago
YukunLi99 / AdaptSAM
View on GitHub
☆22Jun 30, 2023Updated 3 years ago
ddupont808 / GPT-4V-Act
View on GitHub
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
☆1,059Dec 9, 2024Updated last year
inhai-wiki / Trickle-On-WeChat
View on GitHub
在微信端使用类似Trickle的图片信息识别和提炼，并进行图片信息管理的功能。
☆81Sep 19, 2023Updated 2 years ago
mynameischaos / Lion
View on GitHub
Lion: Kindling Vision Intelligence within Large Language Models
☆51Jan 25, 2024Updated 2 years ago
BradyFU / Awesome-Multimodal-Large-Language-Models
View on GitHub
Latest Advances on Multimodal Large Language Models
☆17,959Updated this week
langgptai / wonderful-prompts
View on GitHub
🔥中文 prompt 精选🔥，ChatGPT 使用指南，提升 ChatGPT 可玩性和可用性！🚀
☆6,206Oct 22, 2025Updated 9 months ago
GeWu-Lab / Valuate-and-Enhance-Multimodal-Cooperation
View on GitHub
The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024
☆62Nov 5, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
microsoft / SoM
View on GitHub
[arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs
☆1,551Aug 19, 2024Updated last year
WangRongsheng / Slides-Reports-and-papers
View on GitHub
⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文
☆12Oct 27, 2024Updated last year
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆289Dec 3, 2024Updated last year
ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
EmbraceAGI / Duo-Cai-You-Xi
View on GitHub
☆97May 26, 2024Updated 2 years ago
RupertLuo / VoCoT
View on GitHub
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
☆79Jul 13, 2024Updated 2 years ago
LvXinTao / Mocap-to-SMPLX
View on GitHub
This repository is for fitting mocap data into SMPL-X parameters in [CVPR 2024]"Inter-X: Towards Versatile Human-Human Interaction Analys…
☆49Jun 1, 2024Updated 2 years ago
yaotingwangofficial / Awesome-MCoT
View on GitHub
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
☆1,016May 22, 2026Updated 2 months ago
DwanZhang-AI / SePPO
View on GitHub
Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."
☆18Oct 7, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
langgptai / LangGPT
View on GitHub
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词（Structured Prompt）提出者 📌 元提示词（Meta-Prompt）发起者 📌 最流行的提示词落地范式 | La…
☆12,385Jul 16, 2026Updated last week
dkimlab / MCMED
View on GitHub
☆48May 16, 2025Updated last year
EmbraceAGI / Awesome-AGI
View on GitHub
A curated list of awesome AGI frameworks, software and resources
☆573Sep 27, 2023Updated 2 years ago
baaivision / Emu
View on GitHub
Emu Series: Generative Multimodal Models from BAAI
☆1,776Jan 12, 2026Updated 6 months ago
AILab-CVC / SEED-Bench
View on GitHub
(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.
☆366Jan 14, 2025Updated last year
Alpha-VLLM / WeMix-LLM
View on GitHub
☆17Oct 15, 2023Updated 2 years ago
markus-suchi / 3D-DAT
View on GitHub
3D Scene Annotation and Dataset Toolkit
☆10Jun 11, 2023Updated 3 years ago
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
JIA-Lab-research / LLaMA-VID
View on GitHub
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
☆861Jul 29, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aszala / VPEval
View on GitHub
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Nov 29, 2023Updated 2 years ago
THUIR / THUIR-website
View on GitHub
THUIR website
☆10Feb 23, 2026Updated 5 months ago
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
z-x-yang / DoraemonGPT
View on GitHub
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
☆91Jun 19, 2026Updated last month
SCUT-DLVCLab / GPT-4V_OCR
View on GitHub
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
☆128Nov 13, 2023Updated 2 years ago
langgptai / awesome-claude-prompts
View on GitHub
This repo includes Claude prompt curation to use Claude better.
☆5,369Feb 28, 2026Updated 5 months ago
fleek-platform / persona-generator
View on GitHub
The persona-generator is a library designed to transform user input (natural language processing) into structured JSON files representing…
☆16May 5, 2025Updated last year