An Extensible Deep Learning Library
☆2,363May 16, 2026Updated 3 weeks ago
Alternatives and similar repositories for axlearn
Users that are interested in axlearn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple, performant and scalable Jax LLM!☆2,304Jun 2, 2026Updated last week
- MLX: An array framework for Apple silicon☆26,600Updated this week
- CoreNet: A library for training deep neural networks☆6,998Oct 9, 2025Updated 8 months ago
- Examples in the MLX framework☆8,676Apr 6, 2026Updated 2 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆554Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Flax is a neural network library for JAX that is designed for flexibility.☆7,227Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,890Jun 22, 2025Updated 11 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆709Jan 26, 2026Updated 4 months ago
- A PyTorch native platform for training generative AI models☆5,416Updated this week
- ☆8,682Oct 9, 2024Updated last year
- PyTorch native post-training library☆5,768Updated this week
- Minimalistic large language model 3D-parallelism training☆2,711May 26, 2026Updated 2 weeks ago
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,720Apr 25, 2023Updated 3 years ago
- ☆224Jan 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Development repository for the Triton language and compiler☆19,380Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,741Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,516Aug 13, 2024Updated last year
- A simple library for scaling up JAX programs☆147Nov 4, 2025Updated 7 months ago
- Efficient framework-agnostic data loading☆476Oct 1, 2025Updated 8 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆461Jun 1, 2026Updated last week
- Orbax provides common checkpointing and persistence utilities for JAX users☆518Updated this week
- Optax is a gradient processing and optimization library for JAX.☆2,273Jun 1, 2026Updated last week
- This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.☆1,419Aug 4, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- JAX-based neural network library☆3,236Jun 2, 2026Updated last week
- Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.☆5,276May 27, 2026Updated last week
- Fast and memory-efficient exact attention☆24,037Updated this week
- PyTorch native quantization and sparsity for training and inference☆2,847Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆81,909Updated this week
- Tensor library for machine learning☆14,770May 29, 2026Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,077May 26, 2026Updated 2 weeks ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆4,320Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆28,886Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tools for merging pretrained large language models.☆7,108May 6, 2026Updated last month
- 4M: Massively Multimodal Masked Modeling☆1,798Jun 2, 2025Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,933Updated this week
- ☆197May 4, 2026Updated last month
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,457May 19, 2025Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,230Jul 11, 2024Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,343May 19, 2026Updated 3 weeks ago