ed-aisys / edin-mls-25-springLinks
An open-source ML system course
☆28Updated 9 months ago
Alternatives and similar repositories for edin-mls-25-spring
Users that are interested in edin-mls-25-spring are comparing it to the libraries listed below
Sorting:
- ring-attention experiments☆161Updated last year
- Technical report of Kimina-Prover Preview.☆349Updated 6 months ago
- The evaluation framework for training-free sparse attention in LLMs☆108Updated 3 months ago
- Evaluation of LLMs on latest math competitions☆212Updated 3 weeks ago
- mHC kernels implemented in CUDA☆196Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆66Updated this week
- Physics of Language Models, Part 4☆303Updated last week
- Lean formalizations of IMO problem statements☆27Updated 2 months ago
- ☆178Updated last month
- Our solution to Putnam 2025.☆36Updated this week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 9 months ago
- ☆395Updated 3 weeks ago
- Neural theorem proving tutorial, version II☆40Updated last year
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Updated 2 months ago
- ☆224Updated 9 months ago
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆86Updated 4 months ago
- LLMs + Lean, on your laptop or in the cloud☆199Updated 3 months ago
- train with kittens!☆63Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 4 months ago
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆130Updated this week
- ☆13Updated last year
- ☆42Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆131Updated last year
- ☆75Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆90Updated 5 months ago
- Memory optimized Mixture of Experts☆72Updated 5 months ago
- ☆141Updated 4 months ago
- Samples of good AI generated CUDA kernels☆99Updated 7 months ago
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆31Updated 5 months ago
- Experimental GPU language with meta-programming☆24Updated last year