huggingface / kernel-builderLinks
π· Build compute kernels
β68Updated this week
Alternatives and similar repositories for kernel-builder
Users that are interested in kernel-builder are comparing it to the libraries listed below
Sorting:
- Load compute kernels from the Hubβ191Updated this week
- Lightweight toolkit package to train and fine-tune 1.58bit Language modelsβ78Updated last month
- β47Updated 4 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!β46Updated this week
- Collection of autoregressive model implementationβ85Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- β39Updated 2 years ago
- Make triton easierβ46Updated last year
- β47Updated 9 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.β36Updated last year
- Hugging Face Jobsβ17Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.β32Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β101Updated 3 months ago
- NanoGPT (124M) quality in 2.67B tokensβ28Updated last month
- Learn CUDA with PyTorchβ27Updated this week
- Simple high-throughput inference libraryβ119Updated last month
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS β¦β59Updated 8 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IPβ94Updated last month
- Storing long contexts in tiny caches with self-studyβ67Updated this week
- implement llava using candleβ15Updated last year
- Lego for GRPOβ28Updated 3 weeks ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" givenβ¦β14Updated last year
- β56Updated 3 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLoreβ28Updated 9 months ago
- MatFormer repoβ31Updated 6 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β33Updated last month
- β11Updated 4 months ago
- [WIP] Better (FP8) attention for Hopperβ30Updated 4 months ago
- β65Updated 2 weeks ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)β67Updated 3 months ago