ruslangrimov / mnist-minimal-modelLinks

Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset

☆25

Alternatives and similar repositories for mnist-minimal-model

Users that are interested in mnist-minimal-model are comparing it to the libraries listed below

Sorting:

GreenWaves-Technologies / bfloat16
bfloat16 dtype for numpy
☆19Updated last year
rejunity / tiny-asic-1_58bit-matrix-mul
Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit
☆158Updated last year
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆46Updated last year
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
sap-ient-ai / FFF
FastFeedForward Networks
☆20Updated last year
groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆39Updated 2 months ago
FrancescoSaverioZuppichini / pytorch-2.0-benchmark
Benchmarking PyTorch 2.0 different models
☆21Updated 2 years ago
lernapparat / torchhacks
Hacks for PyTorch
☆19Updated 2 years ago
iml130 / nncg
NNCG: A Neural Network Code Generator
☆35Updated 11 months ago
ScalingIntelligence / good-kernels
Samples of good AI generated CUDA kernels
☆84Updated last month
spcl / daceml
A Data-Centric Compiler for Machine Learning
☆84Updated last year
HazyResearch / butterfly
Butterfly matrix multiplication in PyTorch
☆172Updated last year
iree-org / iree-jax
☆52Updated 11 months ago
Ying1123 / awesome-neural-symbolic
A list of awesome neural symbolic papers.
☆47Updated 2 years ago
epoch-research / Compute-Trends
Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".
☆40Updated 3 years ago
graphcore-research / gfloat
Generic floating-point types in Python
☆13Updated 3 months ago
jax-ml / ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
☆280Updated last week
mila-iqia / milabench
Repository of machine learning benchmarks
☆39Updated last week
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 5 months ago
facebookresearch / projUNN
Fast training of unitary deep network layers from low-rank updates
☆28Updated 2 years ago
srivatsankrishnan / oss-arch-gym
Open source version of ArchGym project.
☆117Updated 3 months ago
gau-nernst / quantized-training
Explore training for quantized models
☆20Updated this week
satabios / sconce
E2E AutoML Model Compression Package
☆46Updated 4 months ago
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆56Updated 3 years ago
RaulMurillo / deep-pensieve
A Deep Learning Framework for the Posit Number System
☆29Updated 11 months ago
mit-han-lab / neurips-micronet
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
☆40Updated 4 years ago
softmax1 / Flash-Attention-Softmax-N
CUDA and Triton implementations of Flash Attention with SoftmaxN.
☆70Updated last year
Jokeren / triton-samples
☆28Updated 6 months ago
alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆45Updated last year
srush / drop7
☆18Updated last year