onehr / llama-rs
Run LLaMA inference on CPU, with Rust 🦀🚀🦙
☆20Updated 2 months ago
Alternatives and similar repositories for llama-rs:
Users that are interested in llama-rs are comparing it to the libraries listed below
- 8-bit floating point types for Rust☆46Updated last week
- A diffusers API in Burn (Rust)☆19Updated 8 months ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆38Updated last year
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆34Updated 10 months ago
- ☆31Updated 4 months ago
- Rust bindings to https://github.com/leejet/stable-diffusion.cpp☆16Updated 2 weeks ago
- A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.☆73Updated this week
- A Rust Vector which swaps to disk based on given parameters☆44Updated last year
- GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface☆19Updated last year
- GPU based FFT written in Rust and CubeCL☆20Updated 2 weeks ago
- Tensor library for Zig☆11Updated 4 months ago
- AI Assistant☆20Updated 2 weeks ago
- Bleeding edge low level Rust binding for GGML☆16Updated 9 months ago
- A set of Rust macros for working with OpenAI function/tool calls.☆46Updated 11 months ago
- Asynchronous CUDA for Rust.☆31Updated 4 months ago
- Library for doing RAG☆70Updated last week
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆72Updated 11 months ago
- ☆19Updated 5 months ago
- Graph model execution API for Candle☆13Updated 4 months ago
- A neural network inference library, written in Rust.☆61Updated 8 months ago
- Rust Vector for large amounts of data, that does not copy when growing, by using full `mmap`'d pages.☆22Updated last year
- next generation vector db built with lmdb bindings☆12Updated last week
- A collection of optimisers for use with candle☆34Updated 4 months ago
- image segmentation on video and images☆47Updated last year
- A very fast, accurate and wasm ready port of Diff Match Patch in Rust. The diff implementation is based on Myers' diff algorithm.☆33Updated last month
- Rustic bindings for IREE☆18Updated 2 years ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆42Updated last month
- Rust embedded scripting languages benchmark☆59Updated this week
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆22Updated 2 weeks ago
- Low rank adaptation (LoRA) for Candle.☆144Updated 7 months ago