ruslangrimov / mnist-minimal-model
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆23Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for mnist-minimal-model
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆111Updated 7 months ago
- ☆44Updated 4 months ago
- bfloat16 dtype for numpy☆17Updated last year
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆87Updated last month
- ☆21Updated 11 months ago
- Experiment of using Tangent to autodiff triton☆72Updated 10 months ago
- LLM4HWDesign Starting Toolkit☆17Updated last month
- ☆43Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- ☆26Updated last year
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆18Updated last year
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆43Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆38Updated 10 months ago
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆45Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- First Open-Source Industry-Specific Model for Semiconductors☆103Updated this week
- ☆74Updated 10 months ago
- Unit Scaling demo and experimentation code☆16Updated 8 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆55Updated this week
- FastFeedForward Networks☆18Updated 11 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- ☆72Updated last month
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆9Updated last year
- Attention in SRAM on Tenstorrent Grayskull☆29Updated 4 months ago
- The Riallto Open Source Project from AMD☆69Updated last week
- Jax like function transformation engine but micro, microjax☆26Updated 3 weeks ago
- ☆35Updated 3 weeks ago
- Clean RL implementation using MLX☆27Updated 8 months ago