monatis / clip.cppLinks
CLIP inference in plain C/C++ with no extra dependencies
☆549Updated 7 months ago
Alternatives and similar repositories for clip.cpp
Users that are interested in clip.cpp are comparing it to the libraries listed below
Sorting:
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆306Updated last year
- ggml implementation of BERT☆498Updated last year
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆568Updated 2 years ago
- Python bindings for ggml☆147Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆854Updated last year
- ☆1,282Updated 2 years ago
- SoTA Transformers with C-backend for fast inference on your CPU.☆311Updated 2 years ago
- LLM-based code completion engine☆190Updated last year
- Port of Meta's Encodec in C/C++☆227Updated last year
- C++ implementation for BLOOM☆809Updated 2 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆313Updated 2 years ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- GGUF implementation in C as a library and a tools CLI program☆301Updated 5 months ago
- Inference of Mamba and Mamba2 models in pure C☆196Updated 2 weeks ago
- Python bindings for llama.cpp☆198Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆249Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆193Updated last year
- ☆716Updated last year
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than…☆1,215Updated 3 months ago
- TTS support with GGML☆218Updated 4 months ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,563Updated 10 months ago
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆785Updated this week
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆189Updated last year
- Pybind11 bindings for Whisper.cpp☆344Updated last year
- It is a simple library to speed up CLIP inference up to 3x (K80 GPU)☆231Updated 2 years ago
- FlashAttention (Metal Port)☆579Updated last year
- Official implementation of Half-Quadratic Quantization (HQQ)☆912Updated last month
- The repository for the code of the UltraFastBERT paper☆519Updated last year
- C++ implementation for 💫StarCoder☆459Updated 2 years ago
- ☆1,029Updated 2 years ago