natlamir / LLaVA-Windows
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards multimodal GPT-4 level capabilities.
☆23Updated last year
Alternatives and similar repositories for LLaVA-Windows:
Users that are interested in LLaVA-Windows are comparing it to the libraries listed below
- ☆42Updated last year
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆146Updated 11 months ago
- ☆173Updated 4 months ago
- ☆40Updated last month
- The IMAGE-interrogator for SOTA image captioning☆78Updated 9 months ago
- Training and generation / detection / inference scripts dealing with Yolov8☆53Updated 5 months ago
- ☆16Updated 11 months ago
- Animate a given image with animatediff and controlnet☆131Updated this week
- ☆82Updated 4 months ago
- ComfyUI InstructIR☆69Updated 7 months ago
- Inpaint Anything performs stable diffusion inpainting on a browser UI using masks from Segment Anything.☆246Updated 3 weeks ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆58Updated 2 months ago
- Code release: https://github.com/google/RB-Modulation☆125Updated 4 months ago
- ☆225Updated this week
- Workflow used in this video:☆36Updated 9 months ago
- Apply unlimited masks to unlimited LoRA models☆47Updated last year
- LivePortrait for AUTOMATIC1111 Stable Diffusion WebUI☆59Updated 2 weeks ago
- ☆120Updated last year
- ☆38Updated 7 months ago
- Text to Img with Stable Cascade(on gradio interface), required less vram than original example on official Hugginface☆40Updated 7 months ago
- Craft your visions☆135Updated last year
- Superprompt a 77M Parameter T5 custom trained checkpoint to make dull prompts detailed.☆68Updated 7 months ago
- An Extension for Automatic1111 Webui that increases/decreases the details of images☆83Updated 4 months ago
- An A1111 extension to let the AI make prompts for SD using Oobabooga☆106Updated 10 months ago
- Wrapper nodes for ComfyUI to use some ofthe DiffSynthStudio features☆59Updated 6 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆39Updated 9 months ago
- A system for Prompt generation to improve Text-to-Image performance.☆68Updated last week
- Florence-2 image captioning and tasks☆75Updated 6 months ago
- ☆28Updated last year
- ComfyUI Node for FlashFace☆66Updated 7 months ago