ruslangrimov / mnist-minimal-model
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆23Updated 6 years ago
Related projects: ⓘ
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆103Updated 5 months ago
- bfloat16 dtype for numpy☆16Updated 11 months ago
- Simple and fast low-bit matmul kernels in CUDA☆48Updated this week
- FastFeedForward Networks☆18Updated 9 months ago
- The Riallto Open Source Project from AMD☆63Updated 3 weeks ago
- ☆63Updated 8 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆34Updated 2 months ago
- A list of awesome neural symbolic papers.☆37Updated 2 years ago
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- ☆40Updated 2 months ago
- Experiment of using Tangent to autodiff triton☆66Updated 7 months ago
- ☆19Updated 5 months ago
- Machine-Learning Accelerator System Exploration Tools☆115Updated this week
- Unit Scaling demo and experimentation code☆16Updated 6 months ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆23Updated last month
- ☆26Updated last year
- A Deep Learning Framework for the Posit Number System☆23Updated last month
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆73Updated 3 weeks ago
- ☆18Updated 5 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆41Updated 3 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- LLM4HWDesign Starting Toolkit☆15Updated this week
- Fast Hadamard transform in CUDA, with a PyTorch interface☆87Updated 3 months ago
- Benchmarking PyTorch 2.0 different models☆20Updated last year
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆49Updated 3 weeks ago
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 2 years ago
- ☆21Updated last year
- ☆13Updated 2 months ago
- Attention in SRAM on Tenstorrent Grayskull☆22Updated 2 months ago
- Token Omission Via Attention☆118Updated 7 months ago