monatis / clip.cpp
CLIP inference in plain C/C++ with no extra dependencies
☆456Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for clip.cpp
- LLaVA server (llama.cpp).☆177Updated last year
- Python bindings for ggml☆132Updated 2 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆229Updated 6 months ago
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆557Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech☆719Updated this week
- ggml implementation of BERT☆464Updated 8 months ago
- LLM-based code completion engine☆173Updated last year
- Port of Meta's Encodec in C/C++☆199Updated 2 weeks ago
- ☆501Updated last week
- Official implementation of Half-Quadratic Quantization (HQQ)☆698Updated last week
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆307Updated 9 months ago
- SoTA Transformers with C-backend for fast inference on your CPU.☆312Updated 11 months ago
- C++ implementation for BLOOM☆811Updated last year
- GGUF implementation in C as a library and a tools CLI program☆242Updated 4 months ago
- A ggml (C++) re-implementation of tortoise-tts☆155Updated 2 months ago
- Inference of Mamba models in pure C☆177Updated 8 months ago
- Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than…☆1,048Updated last month
- ☆1,258Updated last year
- FlashAttention (Metal Port)☆382Updated last month
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆550Updated 4 months ago
- A pytorch quantization backend for optimum☆818Updated this week
- Pure C++ implementation of several models for real-time chatting on your computer (CPU)☆374Updated this week
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆251Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- ☆465Updated 2 months ago
- LLM-powered lossless compression tool☆252Updated 2 months ago
- The repository for the code of the UltraFastBERT paper☆514Updated 7 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆86Updated this week
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated 2 months ago
- ☆527Updated 9 months ago