LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆111Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for llama-dfdx
Users that are interested in llama-dfdx are comparing it to the libraries listed below
Sorting:
- Bleeding edge low level Rust binding for GGML☆16Jun 26, 2024Updated last year
- Asynchronous CUDA for Rust.☆37Sep 18, 2025Updated 5 months ago
- Safe rust wrapper around CUDA toolkit☆1,066Feb 27, 2026Updated last week
- An n-dimensional array library that uses wgpu to run compute shaders on all wgpu backends (and multiple at once)☆31May 25, 2020Updated 5 years ago
- Llama2 LLM ported to Rust burn☆280Apr 16, 2024Updated last year
- Low rank adaptation (LoRA) for Candle.☆169Apr 18, 2025Updated 10 months ago
- Rust+OpenCL+AVX2 implementation of LLaMA inference code☆554Feb 12, 2024Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Fast and safe color management system in Rust☆40Feb 28, 2026Updated last week
- A machine learning library for Rust.☆335Aug 19, 2024Updated last year
- allms: One Rust Library to rule them aLLMs☆108Feb 21, 2026Updated 2 weeks ago
- A Deep Learning and preprocessing framework in Rust with support for CPU and GPU.☆133Oct 3, 2023Updated 2 years ago
- your friendly pangenome graph genotyper☆10Feb 6, 2023Updated 3 years ago
- Half-precision floating point types f16 and bf16 for Rust.☆274Feb 11, 2026Updated 3 weeks ago
- Multi-platform high-performance compute language extension for Rust.☆2,031Updated this week
- A Rust implementation of OpenAI's Whisper model using the burn framework☆345May 6, 2024Updated last year
- STL loader for bevy, based on stl_io☆33Jan 16, 2026Updated last month
- ☆14Feb 16, 2021Updated 5 years ago
- A low-resource native app for sharing space with co-workers and friends.☆15Feb 20, 2025Updated last year
- ☆12May 20, 2024Updated last year
- Reading rosbag files in pure Rust☆23May 25, 2022Updated 3 years ago
- Port of MapBox's earcut triangulation code to Rust language☆47May 29, 2025Updated 9 months ago
- ☆33Jan 24, 2026Updated last month
- A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web☆1,745Jul 21, 2024Updated last year
- Conway-Hart Polyhedron Notation in Rust☆80Jan 10, 2026Updated 2 months ago
- A fast llama2 decoder in pure Rust.☆1,062Nov 30, 2023Updated 2 years ago
- Prove multi-opens of EIP-4844 KZG blobs☆16Jun 15, 2023Updated 2 years ago
- Ollama api implementation for spin☆11Feb 16, 2024Updated 2 years ago
- Bindings to TinyGL, a Small, Free and Fast Subset of OpenGL☆13Dec 1, 2022Updated 3 years ago
- rust sdk for zkWasm☆11Feb 11, 2026Updated 3 weeks ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- Non-standard integer types like u7, u9, u10, u63, i7, i9 etc.☆11Nov 11, 2024Updated last year
- various contracts to set off or receive cross-chain calls☆10Apr 27, 2022Updated 3 years ago
- Is it easy to draw a line?☆14Sep 25, 2020Updated 5 years ago
- Rust generators implemented through async/await syntax☆12Sep 29, 2023Updated 2 years ago
- Prover Manager☆25Updated this week
- Rust bindings for the Wolfram|Alpha web API☆11Nov 21, 2017Updated 8 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Andromeda revm execution service☆32Jul 25, 2024Updated last year