srush / llama2.rs
A fast llama2 decoder in pure Rust.
☆1,005Updated 9 months ago
Related projects: ⓘ
- Rust+OpenCL+AVX2 implementation of LLaMA inference code☆537Updated 7 months ago
- LLama.cpp rust bindings☆320Updated 2 months ago
- LLM Orchestrator built in Rust☆261Updated 6 months ago
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web☆1,605Updated last month
- Inference Llama 2 in one file of pure Rust 🦀☆227Updated last year
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆229Updated 3 weeks ago
- A cross-platform browser ML framework.☆567Updated last week
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,063Updated 2 months ago
- An implementation of the diffusers api in Rust☆522Updated 5 months ago
- Tutorial for Porting PyTorch Transformer Models to Candle (Rust)☆235Updated last month
- Fast ML inference & training for Rust with ONNX Runtime☆802Updated this week
- Pure Rust implementation of a minimal Generative Pretrained Transformer☆817Updated last week
- Deep learning in Rust, with shape checked tensors and neural networks☆1,710Updated last month
- 🦀 A curated list of Rust tools, libraries, and frameworks for working with LLMs, GPT, AI☆236Updated 6 months ago
- Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)☆2,582Updated last month
- Inference Llama 2 in one file of pure 🔥☆2,091Updated 4 months ago
- Rust bindings to https://github.com/ggerganov/whisper.cpp☆655Updated last week
- Blazingly fast LLM inference.☆3,429Updated this week
- A blazing fast inference solution for text embeddings models☆2,599Updated this week
- Rust library for OpenAI☆1,111Updated last week
- `llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tas…☆1,303Updated last month
- 🦜️🔗LangChain for Rust, the easiest way to write LLM-based programs in Rust☆511Updated last week
- Deep learning at the speed of light.☆1,441Updated last month
- Llama2 LLM ported to Rust burn☆272Updated 5 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆161Updated 3 months ago
- LSP server leveraging LLMs for code completion (and more?)☆587Updated this week
- Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.☆373Updated 6 months ago
- Library for generating vector embeddings, reranking in Rust☆250Updated 3 weeks ago
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,403Updated last month
- Rust bindings for the C++ api of PyTorch.☆4,193Updated this week