lxe / llavavisionLinks
A simple "Be My Eyes" web app with a llama.cpp/llava backend
☆488Updated last year
Alternatives and similar repositories for llavavision
Users that are interested in llavavision are comparing it to the libraries listed below
Sorting:
- Finetune llama2-70b and codellama on MacBook Air without quantization☆447Updated last year
- llama.cpp with BakLLaVA model describes what does it see☆383Updated last year
- Generative fill in 3D.☆741Updated 5 months ago
- Finetune a LLM to speak like you based on your WhatsApp Conversations☆363Updated last year
- A toolbox for working with WebRTC, Audio and AI☆695Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆767Updated 9 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆891Updated last year
- ☆125Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,606Updated 10 months ago
- BentoDiffusion: A collection of diffusion models served with BentoML☆366Updated last month
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆858Updated last year
- 👀🧠 GPT-4 Vision x 💪⌨️ Vimium = Autonomous Web Agent☆170Updated last year
- Next-token prediction in JavaScript — build fast language and diffusion models.☆143Updated 8 months ago
- Instruct-tune LLaMA on consumer hardware☆362Updated 2 years ago
- Replace OpenAI with Llama.cpp Automagically.☆318Updated 11 months ago
- ☆744Updated last year
- ☆162Updated 11 months ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 5 months ago
- JS tokenizer for LLaMA 1 and 2☆350Updated 11 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆760Updated 3 months ago
- Ctrl-f for videos☆271Updated last year
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆567Updated last year
- ☆277Updated 9 months ago
- Mistral7B playing DOOM☆131Updated 10 months ago
- Full stack voice chatbot☆197Updated 7 months ago
- A fast and minimal framework for building agentic systems☆426Updated 10 months ago
- The creative suite for character-driven AI experiences.☆184Updated 8 months ago
- Transform JSON objects using vector embeddings☆422Updated 10 months ago
- Create API agents from OpenAPI Specs☆182Updated last year