natlamir / LLaVA-WindowsLinks
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
☆23Updated last year
Alternatives and similar repositories for LLaVA-Windows
Users that are interested in LLaVA-Windows are comparing it to the libraries listed below
Sorting:
- Training and generation / detection / inference scripts dealing with Yolov8☆63Updated 9 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆146Updated last year
- LCM test nodes for comfyui☆63Updated last year
- ☆121Updated last year
- Animate a given image with animatediff and controlnet☆132Updated 4 months ago
- ☆43Updated last year
- An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and…☆125Updated last week
- ☆177Updated 9 months ago
- An A1111 extension to let the AI make prompts for SD using Oobabooga☆105Updated last year
- Craft your visions☆137Updated last year
- Apply unlimited masks to unlimited LoRA models☆48Updated last year
- ComfyUI Node for FlashFace☆68Updated 3 months ago
- ☆143Updated last month
- ☆28Updated last year
- ComfyUI Implementaion of ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆159Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆31Updated 5 months ago
- ☆44Updated 6 months ago
- Superprompt a 77M Parameter T5 custom trained checkpoint to make dull prompts detailed.☆68Updated last year
- Stable Diffusion WebUI extension for GPT4V-Image-Captioner☆102Updated 4 months ago
- ComfyUI InstructIR☆71Updated last year
- ComfyUI implementation of Long-CLIP☆147Updated 2 months ago
- ☆62Updated last year
- Experimental use of stable-video-diffusion in ComfyUI☆161Updated last year
- AUTOMATIC1111 UI extension for creating videos synchronized to music.☆62Updated 2 years ago
- ☆85Updated 9 months ago
- ☆79Updated last year
- ☆185Updated last year
- eesahe's ComfyUI Flux ControlNet Utils☆69Updated 9 months ago
- ComfyUI implementation of FLATTEN☆108Updated 9 months ago
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆126Updated 9 months ago