monatis / clip.cpp

CLIP inference in plain C/C++ with no extra dependencies

☆456

Related projects ⓘ

Alternatives and complementary repositories for clip.cpp

trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆177Updated last year
abetlen / ggml-python
Python bindings for ggml
☆132Updated 2 months ago
staghado / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆229Updated 6 months ago
Maknee / minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
☆557Updated last year
PABannier / bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech
☆719Updated this week
skeskinen / bert.cpp
ggml implementation of BERT
☆464Updated 8 months ago
ggml-org / p1
LLM-based code completion engine
☆173Updated last year
PABannier / encodec.cpp
Port of Meta's Encodec in C/C++
☆199Updated 2 weeks ago
Cornell-RelaxML / quip-sharp
☆501Updated last week
mobiusml / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆698Updated last week
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆307Updated 9 months ago
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆312Updated 11 months ago
NouamaneTazi / bloomz.cpp
C++ implementation for BLOOM
☆811Updated last year
antirez / gguf-tools
GGUF implementation in C as a library and a tools CLI program
☆242Updated 4 months ago
balisujohn / tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
☆155Updated 2 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆177Updated 8 months ago
unum-cloud / uform
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than…
☆1,048Updated last month
YavorGIvanov / sam.cpp
☆1,258Updated last year
philipturner / metal-flash-attention
FlashAttention (Metal Port)
☆382Updated last month
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆550Updated 4 months ago
huggingface / optimum-quanto
A pytorch quantization backend for optimum
☆818Updated this week
foldl / chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU)
☆374Updated this week
symisc / tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
☆251Updated last year
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆244Updated 9 months ago
apoorvumang / prompt-lookup-decoding
☆465Updated 2 months ago
AlexBuz / llama-zip
LLM-powered lossless compression tool
☆252Updated 2 months ago
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆514Updated 7 months ago
ikawrakow / ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
☆86Updated this week
intel / neural-speed
An innovative library for efficient LLM inference via low-bit quantization
☆348Updated 2 months ago
Vahe1994 / SpQR
☆527Updated 9 months ago