FFengIll / embedding.cpp
ggml implementation of BERT Embedding
☆24Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for embedding.cpp
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 8 months ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆52Updated 10 months ago
- Training a reward model for RLHF using RWKV.☆14Updated last year
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retriev…☆39Updated 10 months ago
- ☆16Updated 5 months ago
- Inference Llama/Llama2 Modes in NumPy☆19Updated 11 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆37Updated 4 months ago
- ☆52Updated 5 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆86Updated this week
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Sentence Embedding as a Service☆14Updated last year
- First token cutoff sampling inference example☆28Updated 9 months ago
- Implementation of nougat that focuses on processing pdf locally.☆73Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆131Updated 3 months ago
- LLM inference server implementation based on llama.cpp.☆25Updated this week
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 6 months ago
- ☆53Updated 2 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆41Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Python API for https://vespa.ai, the open big data serving engine☆101Updated this week
- A fast RWKV Tokenizer written in Rust☆36Updated 2 months ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated last year
- tinygrad port of the RWKV large language model.☆43Updated 4 months ago
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆152Updated this week
- ☆37Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆73Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 8 months ago