zanussbaum / surfgradLinks
webgpu autograd library
☆24Updated 2 weeks ago
Alternatives and similar repositories for surfgrad
Users that are interested in surfgrad are comparing it to the libraries listed below
Sorting:
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆69Updated 3 months ago
- Samples of good AI generated CUDA kernels☆73Updated last week
- look how they massacred my boy☆63Updated 7 months ago
- PyTorch implementation of models from the Zamba2 series.☆182Updated 4 months ago
- Training Models Daily☆17Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆18Updated 2 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month
- ☆26Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆100Updated 3 months ago
- Rust Implementation of micrograd☆51Updated 11 months ago
- Repository to create traveling waves integrate special information through time☆52Updated 3 months ago
- ☆27Updated last month
- ☆38Updated 10 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆55Updated last month
- Editor with LLM generation tree exploration☆67Updated 3 months ago
- DeMo: Decoupled Momentum Optimization☆188Updated 6 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆94Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- ANE accelerated embedding models!☆17Updated 5 months ago
- LLM training in simple, raw C/Metal Shading Language☆54Updated last year
- Lego for GRPO☆28Updated last week
- Implementing the BitNet model in Rust☆37Updated last year
- ☆180Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Modded vLLM to run pipeline parallelism over public networks☆36Updated 2 weeks ago
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆74Updated last week