zanussbaum / surfgradLinks
webgpu autograd library
☆32Updated 4 months ago
Alternatives and similar repositories for surfgrad
Users that are interested in surfgrad are comparing it to the libraries listed below
Sorting:
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆124Updated 5 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆72Updated 7 months ago
- ☆35Updated 6 months ago
- SIMD quantization kernels☆87Updated last month
- look how they massacred my boy☆63Updated 11 months ago
- Repository to create traveling waves integrate special information through time☆55Updated 2 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆59Updated 5 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Samples of good AI generated CUDA kernels☆91Updated 4 months ago
- Editor with LLM generation tree exploration☆77Updated 7 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- ☆40Updated last year
- ☆89Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆52Updated last month
- DeMo: Decoupled Momentum Optimization☆193Updated 10 months ago
- Implementation of mamba with rust☆88Updated last year
- LLM training in simple, raw C/Metal Shading Language☆58Updated last year
- Pivotal Token Search☆126Updated 2 months ago
- trying to make WebGPU a bit easier to use☆17Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- ☆26Updated 9 months ago
- Lego for GRPO☆29Updated 4 months ago
- lossily compress representation vectors using product quantization☆59Updated 5 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆107Updated 7 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- ☆62Updated 2 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Training Models Daily☆16Updated last year
- ☆43Updated last week