langgptai / Awesome-Multimodal-PromptsLinks
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆275Updated 5 months ago
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆511Updated 2 years ago
- ☆66Updated 2 years ago
- [CVPR 2025] Video Narration as Vocabulary & Video as Long Document☆584Updated 10 months ago
- A curated list of awesome projects and resources related to autonomous AI agents.☆284Updated 2 years ago
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆843Updated 2 years ago
- A curated list of awesome AGI frameworks, software and resources☆560Updated 2 years ago
- The next generation of Multi-Modal Multi-Agent platform.☆111Updated 8 months ago
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆367Updated 2 years ago
- This repo includes all customized GPTs on openai gpt store☆120Updated 2 years ago
- Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from t…☆258Updated 11 months ago
- A simulation of world using GPTs. (depreciated)☆158Updated last year
- ☆252Updated 2 years ago
- ☆483Updated last year
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆482Updated last year
- CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…☆322Updated last year
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆194Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆221Updated 7 months ago
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆774Updated 2 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated 2 years ago
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆270Updated 2 years ago
- ☆121Updated 2 years ago
- Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction…☆78Updated 2 years ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆203Updated last year
- HPT - Open Multimodal LLMs from HyperGAI☆315Updated last year
- Research Trends in LLM-guided Multimodal Learning.☆357Updated 2 years ago
- ☆132Updated 2 years ago
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- General video interaction platform based on LLMs, including Video ChatGPT☆254Updated 2 years ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆763Updated last year
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"☆864Updated 8 months ago