huggingface / kernel-builder
π· Build compute kernels
β17Updated this week
Alternatives and similar repositories for kernel-builder:
Users that are interested in kernel-builder are comparing it to the libraries listed below
- Load compute kernels from the Hubβ99Updated this week
- Make triton easierβ47Updated 9 months ago
- Train, tune, and infer Bamba modelβ86Updated 2 months ago
- β46Updated 8 months ago
- research impl of Native Sparse Attention (2502.11089)β54Updated last month
- Experiment of using Tangent to autodiff tritonβ78Updated last year
- β76Updated 8 months ago
- β14Updated 8 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)β32Updated last month
- This repo is based on https://github.com/jiaweizzhao/GaLoreβ26Updated 6 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.β17Updated last week
- β12Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- Utilities for Training Very Large Modelsβ58Updated 6 months ago
- Here we will test various linear attention designs.β60Updated 11 months ago
- β16Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276β27Updated last month
- Lightweight tools for quick and easy LLM demo'sβ26Updated 6 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- Training hybrid models for dummies.β20Updated 2 months ago
- A small rust-based data loaderβ24Updated 3 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)β60Updated this week
- Mixed precision training from scratch with Tensors and CUDAβ21Updated 10 months ago
- Collection of autoregressive model implementationβ83Updated last month
- Learn CUDA with PyTorchβ19Updated last month
- train with kittens!β54Updated 5 months ago
- β12Updated last year
- Because it's there.β15Updated 6 months ago
- QuIP quantizationβ52Updated last year
- Gpu benchmarkβ57Updated 2 months ago