langgptai / Awesome-Multimodal-PromptsView external linksLinks
Prompts of GPT-4V & DALL-E3 to full utilize the multi-modal ability. GPT4V Prompts, DALL-E3 Prompts.
☆276Aug 18, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Multimodal-Prompts
Users that are interested in Awesome-Multimodal-Prompts are comparing it to the libraries listed below
Sorting:
- Prompts for Music Generation☆35Sep 17, 2023Updated 2 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Dec 22, 2022Updated 3 years ago
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- ☆23Jan 16, 2024Updated 2 years ago
- My learning note in monash FIT course include fit9131 fit9132 fit9136 fit5032 fit5057 fit5136 fit5125☆15Nov 3, 2022Updated 3 years ago
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- ☆16Feb 9, 2026Updated last week
- A curated list of audio-visual learning methods and datasets.☆285Dec 3, 2024Updated last year
- Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"☆48Sep 3, 2025Updated 5 months ago
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- ☆12Jan 10, 2025Updated last year
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Awesome AI GPTs, OpenAI GPTs, GPT-4, ChatGPT, GPTs, Prompts, plugins, Prompts leaking☆1,177Jun 27, 2024Updated last year
- All tools developed by myself for personal purposes.☆16Feb 1, 2026Updated 2 weeks ago
- Convert matlab calibration parameter to opencv and ROS☆14Jul 4, 2021Updated 4 years ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,066Dec 9, 2024Updated last year
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 6 months ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- QRCode scanner via WebRTC☆15Mar 11, 2014Updated 11 years ago
- [arXiv 2023] Set-of-Mark Prompting for GPT-4V and LMMs☆1,515Aug 19, 2024Updated last year
- Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date…☆1,679Updated this week
- Latest Advances on Multimodal Large Language Models☆17,337Feb 7, 2026Updated last week
- a simple pytorch implementation of diffusiom model☆13Mar 20, 2023Updated 2 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- Great claude skills of everyone.☆36Nov 11, 2025Updated 3 months ago
- Official site for FRAMED. This site is generated from the Sitesource repo using DocNet 0.16.4 or higher.☆14Feb 3, 2026Updated last week
- 🤖 GPT-4V Demos • Test the model's vision capabilities in your browser using Streamlit • Easy setup☆18Dec 3, 2023Updated 2 years ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆24Sep 29, 2025Updated 4 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- Hidden cost extractor for SEC filings.☆18Mar 1, 2022Updated 3 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Apr 13, 2022Updated 3 years ago
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Nov 29, 2023Updated 2 years ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 3 months ago
- 🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀☆5,503Oct 22, 2025Updated 3 months ago
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)☆859Jul 29, 2024Updated last year
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)☆17Jun 23, 2023Updated 2 years ago
- ☆17Feb 22, 2024Updated last year
- CLAUDE.md Context Analyzer - A comprehensive tool for analyzing and managing Claude Code memory files☆31Jun 6, 2025Updated 8 months ago