zanussbaum / surfgradLinks
webgpu autograd library
☆26Updated last month
Alternatives and similar repositories for surfgrad
Users that are interested in surfgrad are comparing it to the libraries listed below
Sorting:
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆70Updated 4 months ago
- look how they massacred my boy☆63Updated 8 months ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- ☆38Updated 11 months ago
- Samples of good AI generated CUDA kernels☆83Updated 3 weeks ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- SIMD quantization kernels☆72Updated this week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Lego for GRPO☆28Updated last month
- Repository to create traveling waves integrate special information through time☆53Updated 3 months ago
- ☆26Updated 6 months ago
- ☆27Updated 11 months ago
- tiny code to access tenstorrent blackhole☆52Updated last month
- smolLM with Entropix sampler on pytorch☆150Updated 7 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆95Updated 3 months ago
- Implementing the BitNet model in Rust☆37Updated last year
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆95Updated last month
- LLM training in simple, raw C/Metal Shading Language☆55Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 9 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 9 months ago
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆33Updated 2 months ago
- Modded vLLM to run pipeline parallelism over public networks☆37Updated last month
- PyTorch implementation of models from the Zamba2 series.☆182Updated 5 months ago
- Experimental GPU language with meta-programming☆23Updated 9 months ago
- jsgrad is a dependency-free ML library in Typescript for model inference and training with support to WebGPU and other runtimes.☆54Updated 2 months ago
- ANE accelerated embedding models!☆18Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated 2 months ago
- Latent Large Language Models☆18Updated 10 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year