ruslangrimov / mnist-minimal-modelLinks
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆26Updated 6 years ago
Alternatives and similar repositories for mnist-minimal-model
Users that are interested in mnist-minimal-model are comparing it to the libraries listed below
Sorting:
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆158Updated last year
- bfloat16 dtype for numpy☆19Updated last year
- The Riallto Open Source Project from AMD☆82Updated 4 months ago
- A Deep Learning Framework for the Posit Number System☆29Updated last year
- Experiment of using Tangent to autodiff triton☆80Updated last year
- Open source version of ArchGym project.☆117Updated 3 months ago
- ☆52Updated last year
- E2E AutoML Model Compression Package☆45Updated 5 months ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆18Updated 6 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last week
- ☆12Updated 4 years ago
- LLM4HWDesign Starting Toolkit☆17Updated 10 months ago
- A Data-Centric Compiler for Machine Learning☆84Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆286Updated 2 weeks ago
- Butterfly matrix multiplication in PyTorch☆174Updated last year
- High-Performance SGEMM on CUDA devices☆98Updated 6 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆41Updated 3 years ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆46Updated last year
- A high-efficiency system-on-chip for floating-point compute workloads.☆39Updated 6 months ago
- Samples of good AI generated CUDA kernels☆88Updated 2 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆50Updated this week
- FastFeedForward Networks☆20Updated last year
- Machine-Learning Accelerator System Exploration Tools☆173Updated 2 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Updated 2 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆44Updated 4 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆119Updated last week
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 6 months ago
- Memory Optimizations for Deep Learning (ICML 2023)☆102Updated last year