ashleykleynhans / llava-docker
Docker image for LLaVA: Large Language and Vision Assistant
☆1Updated 7 months ago
Alternatives and similar repositories for llava-docker:
Users that are interested in llava-docker are comparing it to the libraries listed below
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆37Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated last year
- Gradio UI for a Cog API☆66Updated 10 months ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆57Updated 3 months ago
- How to Build an AI Children’s Book Service☆24Updated last year
- A serverless application that uses AnimateDiff to run a Text-to-Video task on RunPod.☆16Updated 11 months ago
- Basic framework for training Dreambooth Stable Diffusion v1.5 on Banana's v1.0 serverless GPU platform☆37Updated 2 years ago
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10Updated 9 months ago
- ☆18Updated 10 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- Packaged version of OOTDiffusion☆21Updated 7 months ago
- 🚀 | A simple worker that can be used as a starting point to build your own custom RunPod Endpoint API worker.☆93Updated 3 months ago
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆117Updated 3 weeks ago
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated 11 months ago
- 4bit bitsandbytes quants of the best 7B vlms☆27Updated 4 months ago
- ☆30Updated last year
- ☆78Updated last year
- ☆43Updated 3 months ago
- ☆22Updated 2 months ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆42Updated 8 months ago
- ☆38Updated 4 months ago
- ☆30Updated 2 years ago
- RunPod worker for Stable Diffusion XL☆27Updated 3 months ago
- SDXL Multi-controlnet with loras☆26Updated 7 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated last year
- Unofficial Fastapi implementation of Stable-Diffusion API☆80Updated 2 years ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆81Updated last year
- Implementation of a discord channel scraper to generate datasets.☆75Updated 8 months ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year