trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆173Updated 10 months ago
Related projects: ⓘ
- WebGPU LLM inference tuned by hand☆145Updated last year
- Python bindings for ggml☆125Updated 2 weeks ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 weeks ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated 11 months ago
- Local ML voice chat using high-end models.☆138Updated last week
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆94Updated 8 months ago
- run paligemma in real time☆122Updated 4 months ago
- MLX-VLM is a package for running Vision LLMs locally on your Mac using MLX.☆187Updated this week
- Inference of Mamba models in pure C☆176Updated 6 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆97Updated 4 months ago
- ☆101Updated 5 months ago
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆219Updated 2 months ago
- Scripts to create your own moe models using mlx☆86Updated 6 months ago
- CLIP inference in plain C/C++ with no extra dependencies☆433Updated last month
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 7 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆62Updated 7 months ago
- A pipeline parallel training script for LLMs.☆79Updated 3 weeks ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- A fast batching API to serve LLM models☆172Updated 4 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 6 months ago
- GRDN.AI app for garden optimization☆68Updated 7 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆170Updated 5 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆154Updated 8 months ago
- ☆133Updated 9 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 3 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆157Updated this week