chelsea0x3b/llama-dfdx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chelsea0x3b/llama-dfdx)

chelsea0x3b / llama-dfdx

LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!

☆113

Alternatives and similar repositories for llama-dfdx

Users that are interested in llama-dfdx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chelsea0x3b / dfdx
View on GitHub
Deep learning in Rust, with shape checked tensors and neural networks
☆1,921Jul 23, 2024Updated 2 years ago
KerfuffleV2 / ggml-sys-bleedingedge
View on GitHub
Bleeding edge low level Rust binding for GGML
☆18Jun 26, 2024Updated 2 years ago
chelsea0x3b / synthesis
View on GitHub
A rust implementation of AlphaZero algorithm
☆60Feb 3, 2023Updated 3 years ago
ariasanovsky / spindle
View on GitHub
☆12Jul 25, 2024Updated last year
vadixidav / toil
View on GitHub
An n-dimensional array library that uses wgpu to run compute shaders on all wgpu backends (and multiple at once)
☆31May 25, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pangenome / gfagino
View on GitHub
your friendly pangenome graph genotyper
☆10Feb 6, 2023Updated 3 years ago
Gadersd / llama2-burn
View on GitHub
Llama2 LLM ported to Rust burn
☆280Apr 16, 2024Updated 2 years ago
Noeda / rllama
View on GitHub
Rust+OpenCL+AVX2 implementation of LLaMA inference code
☆554Feb 12, 2024Updated 2 years ago
EricLBuehler / candle-lora
View on GitHub
Low rank adaptation (LoRA) for Candle.
☆176Apr 18, 2025Updated last year
tiberiusferreira / tensor_compute
View on GitHub
Exploration of GPU computing using WebGPU
☆27Jan 4, 2021Updated 5 years ago
webonnx / wonnx
View on GitHub
A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web
☆1,755Jul 21, 2024Updated 2 years ago
tiby312 / broccoli-project
View on GitHub
☆81Jul 11, 2023Updated 3 years ago
ehsanmok / dlpackrs
View on GitHub
DLPack safe Rust binding
☆15Sep 20, 2022Updated 3 years ago
neovide / vide
View on GitHub
A straightforward wgpu renderer for 2d interfaces
☆22Oct 12, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
charles-r-earp / autograph
View on GitHub
A machine learning library for Rust.
☆334Aug 19, 2024Updated last year
frewsxcv / earcutr
View on GitHub
Port of MapBox's earcut triangulation code to Rust language
☆49May 5, 2026Updated 2 months ago
Modulus-Labs / leela_vs_world
View on GitHub
A repository for the Leela VS the World project for On-Chain machine learning
☆19Sep 26, 2024Updated last year
ddprrt / shuttle-qdrant-openai
View on GitHub
Sample repo for Shuttle Qdrant OpenAI
☆15Nov 21, 2023Updated 2 years ago
Wolvereness / remit
View on GitHub
Rust generators implemented through async/await syntax
☆12Sep 29, 2023Updated 2 years ago
code-sam / graphblas_sparse_linear_algebra
View on GitHub
Rust wrapper for SuiteSparse:GraphBLAS
☆15May 28, 2026Updated last month
greekfetacheese / zeus
View on GitHub
A truly seedless and decentralized self-custodial Ethereum wallet
☆18Updated this week
Gadersd / whisper-burn
View on GitHub
A Rust implementation of OpenAI's Whisper model using the burn framework
☆356May 6, 2024Updated 2 years ago
wkentaro / jqk
View on GitHub
Render a JSON with jq patterns.
☆20Aug 20, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rust-cuda / cuda-sys
View on GitHub
Rust binding to CUDA APIs
☆118Mar 29, 2025Updated last year
LaurentMazare / glim
View on GitHub
☆19Dec 31, 2025Updated 6 months ago
EricLBuehler / candle_graphs
View on GitHub
Graph model execution API for Candle
☆18Jul 27, 2025Updated 11 months ago
srush / llama2.rs
View on GitHub
A fast llama2 decoder in pure Rust.
☆1,063Nov 30, 2023Updated 2 years ago
JonathanWoollett-Light / rust-ad
View on GitHub
An automatic differentiation library for both forward and reverse auto-diff via code transformation written in Rust.
☆16Jan 21, 2022Updated 4 years ago
tracel-ai / cubecl
View on GitHub
Multi-platform high-performance compute language extension for Rust.
☆2,285Updated this week
RoastVeg / tinygl-rs
View on GitHub
Bindings to TinyGL, a Small, Free and Fast Subset of OpenGL
☆13Dec 1, 2022Updated 3 years ago
ashsystems / coqui-rs
View on GitHub
Rust bindings to the https://github.com/coqui-ai TTS library
☆21Mar 27, 2023Updated 3 years ago
vv9k / AIrtifex
View on GitHub
Generative AI web UI and server
☆22May 23, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tracel-ai / models
View on GitHub
Models and examples built with Burn
☆375Apr 28, 2026Updated 2 months ago
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
jpastuszek / odbc-iter
View on GitHub
Rust high level database access library based on 'odbc' crate that uses native ODBC drivers to access a variety of databases
☆12Aug 27, 2024Updated last year
lyronctk / kzg-blob
View on GitHub
Prove multi-opens of EIP-4844 KZG blobs
☆16Jun 15, 2023Updated 3 years ago
blas-lapack-rs / blas-src
View on GitHub
BLAS source of choice
☆44Oct 6, 2025Updated 9 months ago
Rjected / erc3770
View on GitHub
ERC-3770
☆18Jul 9, 2024Updated 2 years ago
sonos / tract
View on GitHub
Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference
☆3,008Updated this week