google-ai-edge / model-explorer
A modern model graph visualizer and debugger
☆1,058Updated this week
Related projects ⓘ
Alternatives and complementary repositories for model-explorer
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,679Updated this week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- Tile primitives for speedy kernels☆1,658Updated this week
- Puzzles for learning Triton☆1,135Updated this week
- llama3.np is a pure NumPy implementation for Llama 3 model.☆975Updated 5 months ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆1,033Updated this week
- LLM Analytics☆615Updated last month
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆337Updated 2 weeks ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆626Updated 7 months ago
- Best practices & guides on how to write distributed pytorch training code☆286Updated 2 weeks ago
- PyTorch native quantization and sparsity for training and inference☆1,585Updated this week
- nanoGPT style version of Llama 3.1☆1,246Updated 3 months ago
- Manipulating Python Programs☆603Updated last week
- UNet diffusion model in pure CUDA☆584Updated 4 months ago
- ☆641Updated this week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆483Updated 3 weeks ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆687Updated 2 months ago
- Schedule-Free Optimization in PyTorch☆1,898Updated 2 weeks ago
- A simple, performant and scalable Jax LLM!☆1,532Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆916Updated 2 weeks ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,155Updated 2 weeks ago
- Felafax is building AI infra for non-NVIDIA GPUs☆509Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆803Updated 3 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆701Updated last week
- ☆448Updated 7 months ago
- ☆2,746Updated 2 months ago
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆270Updated 3 weeks ago
- What would you do with 1000 H100s...☆903Updated 10 months ago
- On-device AI across mobile, embedded and edge for PyTorch☆2,191Updated this week