langgptai / Awesome-Multimodal-PromptsLinks
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆255Updated last year
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆511Updated last year
- [CVPR 2025] Video Narration as Vocabulary & Video as Long Document☆572Updated 4 months ago
- ☆66Updated 2 years ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆450Updated 5 months ago
- A curated list of awesome projects and resources related to autonomous AI agents.☆282Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆141Updated 8 months ago
- This repo includes all customized GPTs on openai gpt store☆119Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆364Updated last year
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆193Updated last year
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆273Updated last year
- HPT - Open Multimodal LLMs from HyperGAI☆316Updated last year
- ☆130Updated last year
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆197Updated 10 months ago
- ☆442Updated 9 months ago
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆803Updated 2 years ago
- 🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VL…☆717Updated 2 months ago
- Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from t…☆241Updated 5 months ago
- The next generation of Multi-Modal Multi-Agent platform.☆100Updated 2 months ago
- Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆90Updated 2 years ago
- Langchain implementation of HuggingGPT☆132Updated 2 years ago
- Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.☆531Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆278Updated last year
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆775Updated last year
- CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…☆318Updated last year
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆125Updated 8 months ago
- 🤖 Awesome list of AGI Agents. Agents 精选资源合集.☆433Updated last year
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆307Updated 3 weeks ago
- ☆249Updated last year
- Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS☆382Updated last year
- The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"☆158Updated last year