ed-aisys / edin-mls-25-springLinks
An open-source ML system course
☆28Updated 9 months ago
Alternatives and similar repositories for edin-mls-25-spring
Users that are interested in edin-mls-25-spring are comparing it to the libraries listed below
Sorting:
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆75Updated 6 months ago
- ☆169Updated last week
- ☆13Updated last year
- ring-attention experiments☆160Updated last year
- Learn CUDA with PyTorch☆124Updated 3 weeks ago
- Samples of good AI generated CUDA kernels☆94Updated 6 months ago
- Technical report of Kimina-Prover Preview.☆348Updated 5 months ago
- ☆324Updated 3 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆85Updated 3 months ago
- ☆24Updated 6 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated last year
- Memory optimized Mixture of Experts☆69Updated 4 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 3 months ago
- Experimental GPU language with meta-programming☆24Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆64Updated this week
- NSA Triton Kernels written with GPT5 and Opus 4.1☆69Updated 4 months ago
- ☆66Updated 9 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆180Updated 5 months ago
- 👷 Build compute kernels☆193Updated this week
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆52Updated 9 months ago
- Physics of Language Models, Part 4☆270Updated last week
- The evaluation framework for training-free sparse attention in LLMs☆106Updated 2 months ago
- Fluid Language Model Benchmarking☆22Updated 3 months ago
- 📄Small Batch Size Training for Language Models☆68Updated 2 months ago
- PyTorch centric eager mode debugger☆48Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 8 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- Evaluation of LLMs on latest math competitions☆204Updated 2 months ago