nomic-ai / komputeLinks

General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

☆49

Alternatives and similar repositories for kompute

Users that are interested in kompute are comparing it to the libraries listed below

Sorting:

iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆55Updated last year
ggerganov / bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
☆52Updated last year
google / minja
A minimalistic C++ Jinja templating engine for LLM chat templates
☆156Updated last month
kroggen / mamba.c
Inference of Mamba models in pure C
☆187Updated last year
RobinQu / instinct.cpp
instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…
☆49Updated 11 months ago
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆54Updated last year
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆87Updated last year
xyzhang626 / embeddings.cpp
ggml implementation of embedding models including SentenceTransformer and BGE
☆58Updated last year
leloykun / llama2.cpp
Inference Llama 2 in one file of pure C++
☆83Updated last year
nomic-ai / llama.cpp
llama.cpp fork used by GPT4All
☆55Updated 4 months ago
ggerganov / stable-diffusion.cpp
Stable Diffusion in pure C/C++
☆58Updated last year
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆17Updated last year
hscspring / llama.np
Inference Llama/Llama2/Llama3 Modes in NumPy
☆21Updated last year
abetlen / ggml-python
Python bindings for ggml
☆141Updated 9 months ago
rahuldshetty / starcoder.js
Web browser version of StarCoder.cpp
☆45Updated last year
BlinkDL / fast.c
Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.
☆71Updated 4 months ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆72Updated 8 months ago
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆191Updated last year
mistralai / vllm-release
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52Updated last year
abhisheknair10 / llama3.cu
Lightweight Llama 3 8B Inference Engine in CUDA C
☆47Updated 3 months ago
monatis / lmm.cpp
Inference of Large Multimodal Models in C/C++. LLaVA and others
☆47Updated last year
huggingface / optimum-amd
AMD related optimizations for transformer models
☆79Updated 7 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
okuvshynov / llama_duo
asynchronous/distributed speculative evaluation for llama3
☆39Updated 10 months ago
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆46Updated 3 months ago
skeskinen / llama-lite
Embeddings focused small version of Llama NLP model
☆104Updated 2 years ago
KevlarKanou / rwkv7.c
Inference RWKV v7 in pure C.
☆34Updated 2 months ago
ngxson / ggml-easy
Thin wrapper around GGML to make life easier
☆35Updated 3 weeks ago
Linaro / tinyBLAS
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆17Updated 5 years ago
blackhole89 / autopen
Editor with LLM generation tree exploration
☆68Updated 4 months ago