zanussbaum / surfgradLinks

webgpu autograd library

☆29

Alternatives and similar repositories for surfgrad

Users that are interested in surfgrad are comparing it to the libraries listed below

Sorting:

N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆70Updated 5 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
ScalingIntelligence / good-kernels
Samples of good AI generated CUDA kernels
☆85Updated last month
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆78Updated this week
KempnerInstitute / traveling-waves-integrate
Repository to create traveling waves integrate special information through time
☆53Updated 4 months ago
okarthikb / state-space-models
☆27Updated last year
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆189Updated 7 months ago
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆56Updated 3 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆76Updated last week
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆97Updated last week
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆38Updated last year
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated last month
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆184Updated 6 months ago
LaurentMazare / mamba.rs
☆130Updated last year
ggerganov / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆30Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆94Updated 8 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 8 months ago
tomsanbear / bitnet-rs
Implementing the BitNet model in Rust
☆38Updated last year
SonicCodes / lucid-v1
realtime latent world model inference demo
☆46Updated 8 months ago
ericyuegu / hal
Training AI for Super Smash Bros. Melee
☆28Updated 3 months ago
blackhole89 / autopen
Editor with LLM generation tree exploration
☆72Updated 5 months ago
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆100Updated 4 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆130Updated last month
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆98Updated 6 months ago
deepsilicon / Sila
☆89Updated 9 months ago