Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆278Aug 18, 2025Updated 6 months ago
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- 让 AI 设计 AI,让大模型帮助小模型进化,用魔法创造魔法! Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petit…☆98Oct 16, 2023Updated 2 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Dec 22, 2022Updated 3 years ago
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated 2 years ago
- ☆23Jan 16, 2024Updated 2 years ago
- ☆11Jan 13, 2024Updated 2 years ago
- 利用这个模板,你可以结构化的生成用于进行AI绘画创作的Prompt,适用于DALLE和MidJourney等多个平台。☆23Mar 8, 2024Updated 2 years ago
- My learning note in monash FIT course include fit9131 fit9132 fit9136 fit5032 fit5057 fit5136 fit5125☆15Nov 3, 2022Updated 3 years ago
- Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"☆48Sep 3, 2025Updated 6 months ago
- ☆12Jan 10, 2025Updated last year
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- 在微信端使用类似Trickle的图片信息识别和提炼,并进行图片信息管理的功能。☆81Sep 19, 2023Updated 2 years ago
- Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking☆1,178Jun 27, 2024Updated last year
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,065Dec 9, 2024Updated last year
- Convert matlab calibration parameter to opencv and ROS☆14Jul 4, 2021Updated 4 years ago
- All tools developed by myself for personal purposes.☆16Feb 1, 2026Updated last month
- ☆13May 9, 2022Updated 3 years ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- QRCode scanner via WebRTC☆15Mar 11, 2014Updated 11 years ago
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 7 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs☆1,520Aug 19, 2024Updated last year
- Latest Advances on Multimodal Large Language Models☆17,416Updated this week
- 🎯 AI 游戏,编织代码、文字,如梦如幻,如诗如歌。☆365Nov 15, 2023Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Great claude skills of everyone.☆37Nov 11, 2025Updated 3 months ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆25Sep 29, 2025Updated 5 months ago
- ☆13Nov 15, 2023Updated 2 years ago
- 🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup☆18Dec 3, 2023Updated 2 years ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Official site for FRAMED. This site is generated from the Sitesource repo using DocNet 0.16.4 or higher.☆14Feb 3, 2026Updated last month
- ☆14Sep 24, 2021Updated 4 years ago
- Save a png or jpeg and option to save prompt/workflow in a text or json file for each image in Comfy + Workflow loading☆24Aug 14, 2023Updated 2 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 4 months ago
- 🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀☆5,550Oct 22, 2025Updated 4 months ago
- Build chatbots with GPT3. Write a text file, get a chat bot.☆16Nov 19, 2022Updated 3 years ago
- ☆17Feb 22, 2024Updated 2 years ago