lxe / llavavisionLinks
A simple "Be My Eyes" web app with a llama.cpp/llava backend
☆490Updated last year
Alternatives and similar repositories for llavavision
Users that are interested in llavavision are comparing it to the libraries listed below
Sorting:
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆365Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantization☆447Updated last year
- A toolbox for working with WebRTC, Audio and AI☆697Updated last year
- Mistral7B playing DOOM☆132Updated 11 months ago
- ☆125Updated last year
- BentoDiffusion: A collection of diffusion models served with BentoML☆369Updated last month
- Agents Capable of Self-Editing Their Prompts / Python Code☆768Updated last year
- 3D to Photo is an open-source package by Dabble, that combines threeJS and Stable diffusion to build a virtual photo studio for product p…☆444Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆857Updated last year
- Replace OpenAI with Llama.cpp Automagically.☆318Updated last year
- Crawls a Multi-Page Application to a zip file, serve the Multi-Page Application from the zip file. A MPA archiver. Could be used as a Sit…☆477Updated last week
- Next-token prediction in JavaScript — build fast language and diffusion models.☆143Updated 9 months ago
- Generative fill in 3D.☆743Updated 6 months ago
- ☆278Updated 10 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆774Updated 10 months ago
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆567Updated last year
- 👀🧠 GPT-4 Vision x 💪⌨️ Vimium = Autonomous Web Agent☆169Updated last year
- The creative suite for character-driven AI experiences.☆186Updated 9 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,613Updated 10 months ago
- A fast and minimal framework for building agentic systems☆428Updated 10 months ago
- (Cross-Platform) An open source approach to locally record and enable searching everything you view on any computer.☆278Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆894Updated last year
- Create API agents from OpenAPI Specs☆183Updated last year
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for ra…☆676Updated 5 months ago
- Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.☆1,150Updated last year
- The procedure and the code to run shap-e sample code locally.☆116Updated 2 years ago
- An implementation of bucketMul LLM inference☆217Updated 11 months ago
- LLaVA server (llama.cpp).☆180Updated last year
- ☆163Updated 11 months ago