Pytorch routines for (Ker)nel (Mac)hines
☆10Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for kermac
Users that are interested in kermac are comparing it to the libraries listed below
Sorting:
- ☆18Nov 11, 2025Updated 3 months ago
- ☆60Apr 12, 2025Updated 10 months ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- ☆44Updated this week
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 6 months ago
- EigenPro Iteration in PyTorch☆19Jan 9, 2024Updated 2 years ago
- ☆23Jan 25, 2024Updated 2 years ago
- Benchmarking Optimizers for LLM Pretraining☆52Dec 30, 2025Updated 2 months ago
- A bunch of kernels that might make stuff slower 😉☆75Feb 18, 2026Updated last week
- ☆28Jan 17, 2025Updated last year
- ☆53Updated this week
- 详细双语注释版word2vec源码,well-annotated word2vec☆10Oct 3, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆27Dec 3, 2025Updated 2 months ago
- Python Based Domain Coloring☆34Dec 27, 2025Updated 2 months ago
- ☆12May 14, 2025Updated 9 months ago
- Slimebound character mod for Slay the Spire☆14Jun 30, 2020Updated 5 years ago
- Vector Approximate Message Passing (VAMP)☆11May 14, 2023Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated 11 months ago
- carbon.now.sh python module☆11Oct 10, 2021Updated 4 years ago
- CUTLASS and CuTe Examples☆132Nov 30, 2025Updated 3 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆45Mar 12, 2022Updated 3 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- Stackfish is an open-source LLM-powered pipeline designed to automatically solve competitive programming problems.☆53Dec 14, 2024Updated last year
- ☆14Feb 11, 2026Updated 2 weeks ago
- GeekGameBoard (GGB) is a small framework for building board and card games. It's based on Apple's Core Animation framework.☆21Mar 14, 2013Updated 12 years ago
- Accelerating LLM inference with techniques like speculative decoding, quantization, and kernel fusion, focusing on implementing state-of-…☆11Jul 1, 2025Updated 7 months ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- ☆14Mar 8, 2025Updated 11 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆16Jan 17, 2025Updated last year
- diffusers with search engine☆11Jan 13, 2026Updated last month
- Learning materials for Stanford Compiler course : CS143☆18Oct 19, 2021Updated 4 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 10 months ago
- Parallel Self-Adjusting Computation☆15Jul 5, 2021Updated 4 years ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- Automated bottleneck detection and solution orchestration☆19Updated this week
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago