langgptai / Awesome-Multimodal-PromptsLinks
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆256Updated 2 weeks ago
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Video Narration as Vocabulary & Video as Long Document☆575Updated 5 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆512Updated 2 years ago
- ☆66Updated 2 years ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆466Updated 7 months ago
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆142Updated 10 months ago
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆193Updated last year
- This repo includes all customized GPTs on openai gpt store☆118Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆366Updated last year
- Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from t…☆243Updated 7 months ago
- ☆132Updated last year
- A curated list of awesome projects and resources related to autonomous AI agents.☆286Updated last year
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆271Updated last year
- The next generation of Multi-Modal Multi-Agent platform.☆106Updated 3 months ago
- ☆182Updated 3 weeks ago
- HPT - Open Multimodal LLMs from HyperGAI☆315Updated last year
- open-o1: Using GPT-4o with CoT to Create o1-like Reasoning Chains☆116Updated 7 months ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆301Updated 2 years ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆216Updated 2 months ago
- 🤖 Awesome list of AGI Agents. Agents 精选资源合集.☆461Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆278Updated last year
- CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…☆318Updated last year
- A simulation of world using GPTs. (depreciated)☆157Updated last year
- ☆92Updated last year
- Search, organize, discover anything!☆48Updated last year
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆817Updated 2 years ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆124Updated 9 months ago
- ☆79Updated last year
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆774Updated last year
- ☆65Updated last year
- ☆121Updated last year