ashleykleynhans / llava-docker
Docker image for LLaVA: Large Language and Vision Assistant
☆1Updated 9 months ago
Alternatives and similar repositories for llava-docker:
Users that are interested in llava-docker are comparing it to the libraries listed below
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆38Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated last year
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- ☆30Updated last year
- ☆46Updated last year
- A curated list of amazing RunPod projects, libraries, and resources☆111Updated 8 months ago
- Gradio UI for a Cog API☆67Updated last year
- LoRA inference model packaged with Cog☆74Updated last year
- All the world is a play, we are but actors in it.☆49Updated this week
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- Style-Transfer: Apply the style of an image to another image☆52Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆30Updated last year
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆59Updated 5 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆58Updated last year
- ☆21Updated 5 months ago
- ☆50Updated 2 years ago
- SDXL Multi-controlnet with loras☆26Updated 9 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 7 months ago
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆123Updated last year
- Running Ollama with Runpod☆59Updated 9 months ago
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- ☆79Updated last year
- Stable Fashion: A prompt based virtual try on repository☆87Updated 2 years ago
- Starting point to build your own custom serverless endpoint☆102Updated this week
- Implementations of zero-shot capabilities with Open AI's CLIP and computer vision models☆34Updated 7 months ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- Transfer the style of your video. Use on ClarityAI.co☆71Updated 9 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆86Updated last year
- ☆25Updated last year