zanussbaum / surfgrad
webgpu autograd library
☆21Updated 2 months ago
Alternatives and similar repositories for surfgrad:
Users that are interested in surfgrad are comparing it to the libraries listed below
- ☆25Updated 2 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Latent Large Language Models☆17Updated 5 months ago
- LLM training in simple, raw C/Metal Shading Language☆47Updated 9 months ago
- ☆86Updated 4 months ago
- PyTorch implementation of models from the Zamba2 series.☆176Updated 3 weeks ago
- WebGPU LLM inference tuned by hand☆149Updated last year
- ☆27Updated 7 months ago
- look how they massacred my boy☆63Updated 4 months ago
- ☆38Updated 6 months ago
- ☆34Updated 7 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 7 months ago
- RWKV-7: Surpassing GPT☆79Updated 3 months ago
- ☆123Updated 6 months ago
- Training code for Sparse Autoencoders on Embedding models☆35Updated 2 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆57Updated 3 weeks ago
- Implementation of mamba with rust☆77Updated 11 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- Chat Markup Language conversation library☆55Updated last year
- alternative way to calculating self attention☆18Updated 8 months ago
- A fork of llama3.c used to do some R&D on inferencing☆18Updated 2 months ago
- trying to make WebGPU a bit easier to use☆16Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated last year
- Experimental compiler for deep learning models☆26Updated last week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- Rust Implementation of micrograd☆51Updated 7 months ago