FFengIll / embedding.cppLinks
ggml implementation of BERT Embedding
☆26Updated 2 years ago
Alternatives and similar repositories for embedding.cpp
Users that are interested in embedding.cpp are comparing it to the libraries listed below
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆53Updated 2 years ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆63Updated 2 years ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆57Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackers☆167Updated last year
- Sentence Transformers API: An OpenAI compatible embedding API server☆70Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Embedding models from Jina AI☆65Updated 2 years ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆42Updated 7 months ago
- Python bindings for ggml☆147Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- ☆24Updated last year
- BlinkDL's RWKV-v4 running in the browser☆48Updated 2 years ago
- Rust implementation of Surya☆65Updated 11 months ago
- A SQLite extension for generating text embeddings from GGUF models using llama.cpp☆244Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆51Updated 11 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆66Updated 9 months ago
- llama.cpp gguf file parser for javascript☆50Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆60Updated 2 months ago
- ☆51Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provi…☆41Updated 10 months ago
- Sentence Embedding as a Service☆15Updated 7 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- Thin wrapper around GGML to make life easier☆42Updated 3 months ago