Rust+OpenCL+AVX2 implementation of LLaMA inference code
☆554Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for rllama
Users that are interested in rllama are comparing it to the libraries listed below
Sorting:
- [Unmaintained, see README] An ecosystem of Rust libraries for working with large language models☆6,150Jun 24, 2024Updated last year
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Sep 2, 2023Updated 2 years ago
- Deep learning in Rust, with shape checked tensors and neural networks☆1,896Jul 23, 2024Updated last year
- `llm-chain` is a powerful rust crate for building chains in large language models allowing you to summarise text and complete complex tas…☆1,592Oct 31, 2024Updated last year
- A fast llama2 decoder in pure Rust.☆1,061Nov 30, 2023Updated 2 years ago
- Bleeding edge low level Rust binding for GGML☆16Jun 26, 2024Updated last year
- An implementation of the diffusers api in Rust☆586Apr 4, 2024Updated last year
- Inference Llama 2 in one file of pure Rust 🦀☆235Sep 11, 2023Updated 2 years ago
- Llama2 LLM ported to Rust burn☆280Apr 16, 2024Updated last year
- Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.☆14,473Updated this week
- Rust bindings for the C++ api of PyTorch.☆5,302Jan 22, 2026Updated last month
- Minimalist ML framework for Rust☆19,509Updated this week
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆111Jul 27, 2023Updated 2 years ago
- LLama.cpp rust bindings☆415Jun 27, 2024Updated last year
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web☆1,745Jul 21, 2024Updated last year
- Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)