langgptai / Awesome-Multimodal-Prompts
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆250Updated last year
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- A curated list of awesome projects and resources related to autonomous AI agents.☆280Updated last year
- ☆66Updated last year
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆508Updated last year
- [CVPR 2025] Video Narration as Vocabulary & Video as Long Document☆569Updated 2 months ago
- Sora AI Awesome List – Your go-to resource hub for all things Sora AI, OpenAI's groundbreaking model for crafting realistic scenes from t…☆237Updated 3 months ago
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆357Updated last year
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆433Updated 3 months ago
- ☆126Updated last year
- ☆249Updated last year
- ☆55Updated last year
- GPT4Tools is an intelligent system that can automatically decide, control, and utilize different visual foundation models, allowing the u…☆773Updated last year
- WebDesignAgent : Towards Effortless Website Creation☆250Updated 7 months ago
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆785Updated last year
- ☆427Updated 7 months ago
- OpenGPTs- Powerful GPTs Colipot | 强大的gpts浏览器插件|多窗口|批量对话|chatgpt3.5|chatgpt4.0☆189Updated 9 months ago
- ☆176Updated 10 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆349Updated last year
- A simulation of world using GPTs. (depreciated)☆157Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆300Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆139Updated 6 months ago
- Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS☆382Updated last year
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆287Updated 11 months ago
- 🤖 Awesome list of AGI Agents. Agents 精选资源合集.☆394Updated last year
- Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆88Updated last year
- 让 AI 设计 AI,让大模型帮助小模型进化,用魔法创造魔法! Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petit…☆96Updated last year
- Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"☆863Updated last week
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (a…☆219Updated 2 weeks ago
- This repo includes all customized GPTs on openai gpt store☆119Updated last year
- ☆109Updated last year
- ☆51Updated 9 months ago