ed-aisys / edin-mls-25-spring
An open-source ML system course
☆26Updated 2 weeks ago
Alternatives and similar repositories for edin-mls-25-spring:
Users that are interested in edin-mls-25-spring are comparing it to the libraries listed below
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆62Updated last week
- The official repository for the paper Multilingual Mathematical Autoformalization☆34Updated 10 months ago
- ring-attention experiments☆129Updated 5 months ago
- Harmonic Datasets☆37Updated 8 months ago
- LLMs + Lean, on your laptop or in the cloud☆140Updated 5 months ago
- ☆60Updated 11 months ago
- ☆37Updated 6 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆72Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆28Updated 2 weeks ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆59Updated 5 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆35Updated this week
- Unofficial Implementation of Evolutionary Model Merging☆37Updated last year
- A minimal implementation of vllm.☆37Updated 8 months ago
- ☆30Updated 2 months ago
- Tutorial on neural theorem proving☆168Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆59Updated 2 months ago
- 🔥 A minimal training framework for scaling FLA models☆92Updated last week
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆51Updated 11 months ago
- ☆74Updated 7 months ago
- ☆13Updated 9 months ago
- ☆83Updated 2 months ago
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- ☆65Updated last month
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆173Updated last month
- Benchmark for undergraduate-level formal mathematics☆103Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆103Updated 4 months ago
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆59Updated 2 months ago
- Flash Hyperbolic Attention in ~[...] lines of CUDA☆21Updated 11 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 8 months ago
- [ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection☆94Updated last month