ruslangrimov / mnist-minimal-model
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆24Updated 6 years ago
Alternatives and similar repositories for mnist-minimal-model:
Users that are interested in mnist-minimal-model are comparing it to the libraries listed below
- High-Performance SGEMM on CUDA devices☆76Updated last month
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆43Updated 7 months ago
- Experiment of using Tangent to autodiff triton☆75Updated last year
- Unit Scaling demo and experimentation code☆16Updated 11 months ago
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- Personal solutions to the Triton Puzzles☆17Updated 7 months ago
- FlexAttention w/ FlashAttention3 Support☆26Updated 4 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆118Updated 10 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated 2 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆23Updated this week
- E2E AutoML Model Compression Package☆46Updated 3 weeks ago
- FastFeedForward Networks☆19Updated last year
- ☆51Updated 6 months ago
- Explore training for quantized models☆15Updated last month
- Gpu benchmark☆52Updated 3 weeks ago
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆102Updated 4 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- ☆21Updated last week
- RWKV model implementation☆37Updated last year
- ☆46Updated last year
- Hacks for PyTorch☆18Updated last year
- ☆12Updated 3 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆61Updated 3 weeks ago
- extensible collectives library in triton☆83Updated 4 months ago
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- ☆107Updated last month
- Attention in SRAM on Tenstorrent Grayskull☆31Updated 7 months ago
- ☆84Updated last month
- Learning about CUDA by writing PTX code.☆35Updated 11 months ago