AI-Hypercomputer / maxtext
A simple, performant and scalable Jax LLM!
☆1,532Updated this week
Related projects ⓘ
Alternatives and complementary repositories for maxtext
- A native PyTorch Library for large model training☆2,635Updated this week
- Training LLMs with QLoRA + FSDP☆1,420Updated 2 weeks ago
- ☆892Updated last month
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆687Updated 3 months ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆457Updated last week
- Reaching LLaMA2 Performance with 0.1M Dollars☆961Updated 3 months ago
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,199Updated this week
- Tile primitives for speedy kernels☆1,661Updated this week
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,170Updated this week
- Puzzles for learning Triton☆1,138Updated this week
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- PyTorch native quantization and sparsity for training and inference☆1,592Updated this week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,680Updated this week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,530Updated 4 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,673Updated last month
- NanoGPT (124M) in 5 minutes☆1,269Updated this week
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,436Updated 3 weeks ago
- ☆2,506Updated 6 months ago
- An Extensible Deep Learning Library☆1,876Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆804Updated 3 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,338Updated 7 months ago
- ☆448Updated 7 months ago
- PyTorch native finetuning library☆4,346Updated this week
- A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.☆2,678Updated this week
- What would you do with 1000 H100s...☆910Updated 10 months ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,694Updated last week
- ☆224Updated 4 months ago
- Minimalistic large language model 3D-parallelism training☆1,265Updated this week
- ☆1,968Updated this week