ruslangrimov / mnist-minimal-modelLinks
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆25Updated 6 years ago
Alternatives and similar repositories for mnist-minimal-model
Users that are interested in mnist-minimal-model are comparing it to the libraries listed below
Sorting:
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆152Updated last year
- FastFeedForward Networks☆20Updated last year
- General Matrix Multiplication using NVIDIA Tensor Cores☆17Updated 4 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆44Updated 10 months ago
- E2E AutoML Model Compression Package☆46Updated 2 months ago
- Personal solutions to the Triton Puzzles☆18Updated 10 months ago
- bfloat16 dtype for numpy☆19Updated last year
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆108Updated 7 months ago
- Experiment of using Tangent to autodiff triton☆79Updated last year
- Samples of good AI generated CUDA kernels☆73Updated last week
- The Riallto Open Source Project from AMD☆80Updated last month
- High-Performance SGEMM on CUDA devices☆94Updated 4 months ago
- ☆21Updated last year
- LLM4HWDesign Starting Toolkit☆17Updated 8 months ago
- ☆48Updated 10 months ago
- Unit Scaling demo and experimentation code☆16Updated last year
- An AI accelerator implementation with Xilinx FPGA☆46Updated 4 months ago
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- C++ and Python libraries for neural networks.☆15Updated 3 weeks ago
- Generic floating-point types in Python☆12Updated 2 months ago
- Torch2Chip (MLSys, 2024)☆51Updated 2 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Updated 2 years ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 6 months ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated 10 months ago
- ☆23Updated 5 months ago
- ☆91Updated last year
- ☆18Updated last year
- ☆52Updated 9 months ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆19Updated last year
- train with kittens!☆57Updated 7 months ago