ashleykleynhans / llava-dockerLinks
Docker image for LLaVA: Large Language and Vision Assistant
☆1Updated 3 weeks ago
Alternatives and similar repositories for llava-docker
Users that are interested in llava-docker are comparing it to the libraries listed below
Sorting:
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆90Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated last year
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆38Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- Cog wrapper for Vchitect/SEINE☆37Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated 2 weeks ago
- Unofficial Fastapi implementation of Stable-Diffusion API☆81Updated 2 years ago
- Use text-to-image models Stable Diffusion, DALL-E2, DALL-E3, SDXL, SSD-1B, Kandinsky-2.2, and LCM from UI. Add images directly to your da…☆33Updated last year
- ☆79Updated last year
- Gradio UI for a Cog API☆66Updated last year
- A curated list of amazing RunPod projects, libraries, and resources☆115Updated 9 months ago
- ☆39Updated 3 weeks ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆46Updated last year
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆2Updated last year
- An OpenAI-like LLaMA inference API☆112Updated last year
- ☆95Updated last year
- ☆52Updated last year
- Record a sample of your own voice and let AI narrate the text in your own voice.☆80Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆16Updated 4 months ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.☆31Updated last year
- ☆55Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆45Updated last year
- All the world is a play, we are but actors in it.☆50Updated this week
- A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large …☆122Updated last year
- ☆30Updated last year
- Starting point to build your own custom serverless endpoint☆107Updated 3 weeks ago
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated last year
- ☆221Updated last year
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year