natlamir / LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
☆23Updated last year
Alternatives and similar repositories for LLaVA-Windows:
Users that are interested in LLaVA-Windows are comparing it to the libraries listed below
- Animate a given image with animatediff and controlnet☆132Updated 3 months ago
- ☆178Updated 7 months ago
- Experimental use of stable-video-diffusion in ComfyUI☆161Updated 11 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆145Updated last year
- ☆78Updated last year
- Superprompt a 77M Parameter T5 custom trained checkpoint to make dull prompts detailed.☆68Updated 11 months ago
- LCM test nodes for comfyui☆62Updated last year
- ☆143Updated 2 weeks ago
- ComfyUI InstructIR☆71Updated 11 months ago
- LivePortrait for AUTOMATIC1111 Stable Diffusion WebUI☆71Updated 2 months ago
- ComfyUI wrapper node for original freecontrol diffusers implementation☆67Updated last year
- A Graphical userinterface "on top" of Deforum, that lets you control AI-generated video in real time☆72Updated 4 months ago
- ☆120Updated last year
- Set of Utilities I Have Coded to Help Me Train RPGv6 on Flux1☆79Updated 7 months ago
- ComfyUI implementation of FLATTEN☆107Updated 7 months ago
- ☆37Updated last year
- A system for Prompt generation to improve Text-to-Image performance.☆77Updated last month
- See original repo here: https://github.com/google/RB-Modulation - ICLR 2025 (Oral)☆125Updated 8 months ago
- The IMAGE-interrogator for SOTA image captioning☆82Updated last year
- Craft your visions☆138Updated last year
- ☆43Updated last year
- ZeST: Zero-Shot Material Transfer from a Single Image☆171Updated 9 months ago
- ComfyUI Implementaion of ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment☆160Updated last year
- Overlay text on an image in ComfyUI with font/alignment/placement customization☆56Updated 8 months ago
- AUTOMATIC1111 UI extension for creating videos synchronized to music.☆63Updated last year
- ☆180Updated last year
- ComfyUI Node for FlashFace☆68Updated 2 months ago
- ☆105Updated 10 months ago
- Diffusers Implementation of Controlling Text-to-Image Diffusion by Orthogonal Finetuning☆35Updated last year
- ☆42Updated 4 months ago