apple / axlearnLinks
An Extensible Deep Learning Library
☆2,057Updated last week
Alternatives and similar repositories for axlearn
Users that are interested in axlearn are comparing it to the libraries listed below
Sorting:
- A simple, performant and scalable Jax LLM!☆1,734Updated this week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,197Updated this week
- A PyTorch native platform for training generative AI models☆3,838Updated this week
- PyTorch native quantization and sparsity for training and inference☆2,064Updated this week
- Efficient framework-agnostic data loading☆422Updated this week
- Tile primitives for speedy kernels☆2,399Updated this week
- Minimalistic large language model 3D-parallelism training☆1,888Updated last week
- 4M: Massively Multimodal Masked Modeling☆1,721Updated last week
- PyTorch native post-training library☆5,217Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,300Updated last month
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,780Updated last month
- Inference Llama 2 in one file of pure 🔥☆2,108Updated last year
- Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and …☆1,350Updated this week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,170Updated 7 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,960Updated last month
- FlashInfer: Kernel Library for LLM Serving☆3,044Updated this week
- ☆2,952Updated 8 months ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,435Updated last week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,505Updated 2 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,555Updated last week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,220Updated last week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,916Updated this week
- Puzzles for learning Triton☆1,658Updated 6 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆639Updated last week
- DataComp for Language Models☆1,300Updated 2 months ago
- Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)☆2,634Updated 2 years ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,586Updated last week
- CoreNet: A library for training deep neural networks☆7,013Updated 3 weeks ago
- LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …☆819Updated 5 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,445Updated this week