trzy / llava-cpp-serverLinks

LLaVA server (llama.cpp).

☆183

Alternatives and similar repositories for llava-cpp-server

Users that are interested in llava-cpp-server are comparing it to the libraries listed below

Sorting:

herrera-luis / vision-core-ai
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
☆46Updated 2 years ago
monatis / lmm.cpp
Inference of Large Multimodal Models in C/C++. LLaVA and others
☆48Updated 2 years ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆100Updated 2 years ago
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
sumo43 / loopvlm
run paligemma in real time
☆133Updated last year
abetlen / ggml-python
Python bindings for ggml
☆146Updated last year
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆381Updated 2 years ago
NousResearch / Obsidian
Maybe the new state of the art vision model? we'll see 🤷‍♂️
☆167Updated last year
sshh12 / multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
☆187Updated last year
ggerganov / bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
☆53Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆85Updated last year
mzbac / mlx-lora
☆38Updated last year
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆248Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆68Updated 3 weeks ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated 2 years ago
4dh / GRDN
GRDN.AI app for garden optimization
☆69Updated 2 weeks ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆170Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
monatis / clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
☆540Updated 5 months ago
multiplexerai / Complex-to-Simple-RAG
☆40Updated last year
skeskinen / llama-lite
Embeddings focused small version of Llama NLP model
☆107Updated 2 years ago
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆175Updated last year
teknium1 / ShareGPT-Builder
☆117Updated 11 months ago
abacaj / replit-3B-inference
Run inference on replit-3B code instruct model using CPU
☆160Updated 2 years ago
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆75Updated 2 years ago
balisujohn / tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
☆192Updated last year