google-ai-edge / model-explorerLinks
A modern model graph visualizer and debugger
☆1,349Updated last week
Alternatives and similar repositories for model-explorer
Users that are interested in model-explorer are comparing it to the libraries listed below
Sorting:
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,431Updated this week
- llama3.np is a pure NumPy implementation for Llama 3 model.☆992Updated 7 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,830Updated 6 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,401Updated 8 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆656Updated 6 months ago
- PyTorch native quantization and sparsity for training and inference☆2,576Updated this week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,152Updated this week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆587Updated 4 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆829Updated 4 months ago
- Efficient framework-agnostic data loading☆452Updated 2 months ago
- Tile primitives for speedy kernels☆3,008Updated last week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆930Updated last month
- Best practices & guides on how to write distributed pytorch training code☆552Updated 2 months ago
- A simple, performant and scalable Jax LLM!☆2,047Updated this week
- UNet diffusion model in pure CUDA☆656Updated last year
- a small code base for training large models☆315Updated 7 months ago
- A pytorch quantization backend for optimum☆1,019Updated last month
- Official implementation of Half-Quadratic Quantization (HQQ)☆902Updated this week
- An Extensible Deep Learning Library☆2,303Updated last week
- Speed up model training by fixing data loading.☆566Updated last week
- Training LLMs with QLoRA + FSDP☆1,534Updated last year
- GPU programming related news and material links☆1,874Updated 3 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆686Updated last year
- Felafax is building AI infra for non-NVIDIA GPUs☆568Updated 10 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,923Updated 3 months ago
- For optimization algorithm research and development.☆553Updated this week
- Distributed Training Over-The-Internet☆972Updated 2 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆459Updated 2 weeks ago
- Minimalistic large language model 3D-parallelism training☆2,365Updated last week
- Manipulating Python Programs☆704Updated 2 weeks ago