lxe / llavavisionLinks

A simple "Be My Eyes" web app with a llama.cpp/llava backend

☆490

Alternatives and similar repositories for llavavision

Users that are interested in llavavision are comparing it to the libraries listed below

Sorting:

GRVYDEV / S.A.T.U.R.D.A.Y
A toolbox for working with WebRTC, Audio and AI
☆701Updated last year
okuvshynov / slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
☆447Updated last year
Ads97 / WhatsApp-Llama
Finetune a LLM to speak like you based on your WhatsApp Conversations
☆366Updated last year
innovatorved / whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…
☆894Updated last year
OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆382Updated last year
fill3d / fill
Generative fill in 3D.
☆744Updated 6 months ago
pinokiocomputer / llamanet
Replace OpenAI with Llama.cpp Automagically.
☆320Updated last year
belladoreai / llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
☆354Updated last year
mezbaul-h / june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
☆775Updated 11 months ago
bentoml / BentoDiffusion
BentoDiffusion: A collection of diffusion models served with BentoML
☆370Updated 2 months ago
iamarunbrahma / finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
☆257Updated last year
NeumTry / NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
☆859Updated last year
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆131Updated 2 years ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆132Updated last year
bennyschmidt / next-token-prediction
Next-token prediction in JavaScript — build fast language and diffusion models.
☆142Updated 9 months ago
trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆180Updated last year
ShaShekhar / aaiela
☆163Updated last year
bennyschmidt / ragdoll-studio
The creative suite for character-driven AI experiences.
☆185Updated 10 months ago
elfvingralf / macOSpilot-ai-assistant
Voice + Vision powered AI assistant that answers questions about any application, in context and in audio.
☆1,151Updated last year
aymenfurter / microagents
Agents Capable of Self-Editing Their Prompts / Python Code
☆769Updated last year
modal-labs / quillman
A voice chat app
☆1,146Updated last month
Dabble-Studio / 3d-to-photo
3D to Photo is an open-source package by Dabble, that combines threeJS and Stable diffusion to build a virtual photo studio for product p…
☆444Updated last year
fzliu / radient
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
☆278Updated 2 weeks ago
chris-alexiuk / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆362Updated 2 years ago
rogeriochaves / driver
☆125Updated last year
andyk / recursive_llm
Implement recursion using English as the programming language and an LLM as the runtime.
☆238Updated 2 years ago
arc53 / llm-price-compass
This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …
☆221Updated 7 months ago
kolinko / effort
An implementation of bucketMul LLM inference
☆220Updated last year
vlm-run / vlmrun-cookbook
Examples and guides for using the VLM Run API
☆281Updated last week
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆763Updated last month