camenduru / minigpt4Links
☆28Updated 2 years ago
Alternatives and similar repositories for minigpt4
Users that are interested in minigpt4 are comparing it to the libraries listed below
Sorting:
- ☆120Updated 2 years ago
- ☆82Updated 2 years ago
- Grounding DINO with Segment Anything & Stable Diffusion colab☆196Updated last year
- Instruct-tune LLaMA on consumer hardware☆73Updated 2 years ago
- ☆204Updated last year
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Updated last year
- ☆114Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆152Updated 9 months ago
- Diffusion WebUI: Stable Diffusion + ControlNet + Inpaint☆53Updated 2 years ago
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆65Updated 2 years ago
- Image Editing Anything☆116Updated 2 years ago
- 8-bit CUDA functions for PyTorch☆43Updated 2 years ago
- Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4☆27Updated 2 years ago
- ☆135Updated 2 years ago
- ☆88Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆141Updated 7 months ago
- ☆31Updated 2 years ago
- ☆31Updated last year
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- qwen create prompt for sdxl☆34Updated last year
- Awesome repo for ControlNet☆97Updated 2 years ago
- ☆119Updated last year
- WebUI extension for ControlNet, supports LoRA version of ControlNet☆109Updated 2 years ago
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆142Updated 10 months ago
- 支持Taiyi-Diffusion-XL模型的Fooocus☆20Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆40Updated 2 years ago
- ImageSlider custom component for gradio.☆42Updated last year
- webui for HCP-Diffusion☆139Updated 2 years ago