gotzmann / boosterLinks

Booster - open accelerator for LLM models. Better inference and debugging for AI hackers

☆158

Alternatives and similar repositories for booster

Users that are interested in booster are comparing it to the libraries listed below

Sorting:

go-skynet / go-ggml-transformers.cpp
Binding to transformers in ggml
☆63Updated 2 months ago
mudler / go-stable-diffusion
☆16Updated last year
cornelk / llama-go
Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++
☆168Updated 2 years ago
acheong08 / vectordb
A simple vector database: Text encoding, semantic search, document storage
☆91Updated 2 years ago
zenmodel / zenmodel
ZenModel is a framework for building LLM applications with agentic workflow
☆71Updated 8 months ago
tmc / go-llama2
Llama 2 inference in one file of pure Go
☆105Updated last year
seasonjs / stable-diffusion
pure go for stable-diffusion and support cross-platform.
☆57Updated last year
nuance1979 / llama-server
LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.
☆127Updated 2 years ago
saracen / llama2.go
Inference Llama 2 in Go
☆39Updated last year
donomii / go-rwkv.cpp
A go wrapper around the rwkv.cpp library
☆20Updated last year
EinStack / glide
🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps
☆151Updated 11 months ago
haormj / llama2.go
Inference Llama 2 in one file of pure go
☆16Updated last year
tmc / righthand
RightHand - A GPT4 powered assistive tool.
☆113Updated 6 months ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆246Updated last year
mzbac / mlx-lora
☆38Updated last year
c0sogi / llama-api
An OpenAI-like LLaMA inference API
☆112Updated last year
gomlx / go-huggingface
Conveniently download files, models, tokenizers from HuggingFace Hub
☆29Updated 2 weeks ago
trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆180Updated last year
sugarme / transformer
NLP transformers written in Go
☆234Updated 2 years ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆55Updated last year
wangcx18 / llm-vscode-inference-server
An endpoint server for efficiently serving quantized open-source LLMs for code.
☆55Updated last year
IntrinsicLabsAI / gbnfgen
TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces
☆137Updated last year
mmatongo / chew
Chew is a Go library for processing various content types into markdown/plaintext.
☆42Updated 4 months ago
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
neuml / txtai.go
Go client for txtai
☆79Updated last month
ChuloAI / oasis
Local LLaMAs/Models in VSCode
☆53Updated 2 years ago
bold84 / cot_proxy
Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…
☆48Updated last month
pluja / maestro
Turn natual language into commands. Your CLI tasks, now as easy as a conversation. Run it 100% offline, or use OpenAI's models.
☆59Updated last year
distantmagic / structured
Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp
☆45Updated last year
xyzhang626 / embeddings.cpp
ggml implementation of embedding models including SentenceTransformer and BGE
☆58Updated last year