garrisonhess / llama2.c
Inference Llama 2 in one file of pure C
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama2.c
- Tiny inference-only implementation of LLaMA☆91Updated 7 months ago
- Rust implementation of Surya☆51Updated last month
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated 9 months ago
- Semantic Indexer☆51Updated last month
- An ecosystem of Rust libraries for working with large language models☆11Updated last year
- Neural search for web-sites, docs, articles - online!☆127Updated 3 weeks ago
- Locality Sensitive Hashing☆70Updated last year
- Rust client for txtai☆106Updated 3 weeks ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆73Updated last year
- ☆162Updated 5 months ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆55Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆65Updated 3 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- Tantivy directory implementation backed by object_store☆27Updated 9 months ago
- Generate BM25 sparse vector inside PostgreSQL☆53Updated last week
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- ☆57Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- LLM plugin for clustering embeddings☆61Updated 8 months ago
- open source tooling for AI search and understanding☆49Updated last year
- ☆153Updated last year
- ☆10Updated last year
- Vector Database with support for late interaction and token level embeddings.☆53Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 8 months ago
- ☆136Updated 8 months ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆71Updated last year
- A SQLite extension for working with float and binary vectors. Work in progress!☆19Updated last year
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆26Updated 6 months ago
- Modular Rust transformer/LLM library using Candle☆36Updated 6 months ago