garrisonhess / llama2.cLinks
Inference Llama 2 in one file of pure C
☆14Updated 2 years ago
Alternatives and similar repositories for llama2.c
Users that are interested in llama2.c are comparing it to the libraries listed below
Sorting:
- ☆166Updated last year
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆425Updated 11 months ago
- Tiny inference-only implementation of LLaMA☆92Updated last year
- OpenAI compatible API for serving LLAMA-2 model☆218Updated 2 years ago
- Official Rust Implementation of Model2Vec☆152Updated last week
- Neural search for web-sites, docs, articles - online!☆146Updated 6 months ago
- Inference engine for GLiNER models, in Rust☆90Updated last month
- Semantic Indexer☆53Updated last year
- ☆157Updated 2 years ago
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆160Updated 2 years ago
- Rust framework for LLM orchestration☆203Updated last year
- ☆35Updated 2 years ago
- JS tokenizer for LLaMA 1 and 2☆363Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆231Updated this week
- Fast approximate nearest neighbor searching in Rust, based on HNSW index☆343Updated last month
- ☆140Updated last year
- ☆135Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- Enforce structured output from LLMs 100% of the time☆250Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆203Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- Neural Search☆367Updated 11 months ago
- An implementation of bucketMul LLM inference☆225Updated last year
- Rust client for txtai☆113Updated 3 weeks ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆58Updated 8 months ago
- Ask questions, let GPT do the SQL.☆133Updated 2 years ago
- ☆58Updated 2 years ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Updated 2 years ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago