ed-aisys / edin-mls-25-springLinks
An open-source ML system course
☆28Updated 7 months ago
Alternatives and similar repositories for edin-mls-25-spring
Users that are interested in edin-mls-25-spring are comparing it to the libraries listed below
Sorting:
- ring-attention experiments☆155Updated last year
 - Write a fast kernel and run it on Discord. See how you compare against the best!☆58Updated 3 weeks ago
 - PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 7 months ago
 - Learning about CUDA by writing PTX code.☆146Updated last year
 - Memory optimized Mixture of Experts☆69Updated 3 months ago
 - Learn CUDA with PyTorch☆95Updated last month
 - Mixed precision training from scratch with Tensors and CUDA☆28Updated last year
 - The evaluation framework for training-free sparse attention in LLMs☆102Updated 3 weeks ago
 - Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆147Updated 2 years ago
 - in this repository, i'm going to implement increasingly complex llm inference optimizations☆70Updated 5 months ago
 - Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 11 months ago
 - Experimental GPU language with meta-programming☆23Updated last year
 - NSA Triton Kernels written with GPT5 and Opus 4.1☆64Updated 2 months ago
 - High-Performance SGEMM on CUDA devices☆107Updated 9 months ago
 - 👷 Build compute kernels☆163Updated last week
 - How to ensure correctness and ship LLM generated kernels in PyTorch☆111Updated this week
 - TritonParse: A Compiler Tracer, Visualizer, and Reproducer for Triton Kernels☆167Updated this week
 - The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆109Updated 3 weeks ago
 - LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆71Updated 9 months ago
 - ☆300Updated last month
 - Collection of kernels written in Triton language☆159Updated 6 months ago
 - Cataloging released Triton kernels.☆264Updated last month
 - A set of Python scripts that makes your experience on TPU better☆54Updated last month
 - ☆93Updated 4 months ago
 - train with kittens!☆63Updated last year
 - ☆225Updated 2 weeks ago
 - PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆137Updated last month
 - Fluid Language Model Benchmarking☆19Updated last month
 - ☆24Updated 5 months ago
 - ☆65Updated 3 months ago