ruslangrimov / mnist-minimal-modelLinks
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆28Updated 7 years ago
Alternatives and similar repositories for mnist-minimal-model
Users that are interested in mnist-minimal-model are comparing it to the libraries listed below
Sorting:
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆171Updated last year
- The Riallto Open Source Project from AMD☆83Updated 8 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 4 months ago
- Open source version of ArchGym project.☆123Updated 8 months ago
- A Data-Centric Compiler for Machine Learning☆85Updated last week
- A Deep Learning Framework for the Posit Number System☆31Updated last year
- General Matrix Multiplication using NVIDIA Tensor Cores☆27Updated 10 months ago
- Butterfly matrix multiplication in PyTorch☆177Updated 2 years ago
- High-Performance SGEMM on CUDA devices☆113Updated 11 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- Experiment of using Tangent to autodiff triton☆81Updated last year
- Attention in SRAM on Tenstorrent Grayskull☆39Updated last year
- Custom PTX Instruction Benchmark☆136Updated 9 months ago
- ☆27Updated 3 weeks ago
- ☆81Updated 2 weeks ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆43Updated 3 years ago
- Samples of good AI generated CUDA kernels☆94Updated 6 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆320Updated this week
- bfloat16 dtype for numpy☆20Updated 2 years ago
- ☆28Updated 11 months ago
- train with kittens!☆63Updated last year
- Autocomp: AI Code Optimizer for Tensor Accelerators☆56Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆64Updated this week
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆111Updated last year
- ☆54Updated last year
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆215Updated 2 years ago
- python package of rocm-smi-lib☆24Updated last week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 11 months ago
- Torch Frontend for IREE☆25Updated 2 years ago