AI-Hypercomputer / tpu-recipesLinks
☆64Updated this week
Alternatives and similar repositories for tpu-recipes
Users that are interested in tpu-recipes are comparing it to the libraries listed below
Sorting:
- ☆148Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated last week
- Google TPU optimizations for transformers models☆131Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆155Updated this week
- ☆190Updated last week
- MoE training for Me and You and maybe other people☆239Updated last week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 6 months ago
- A set of Python scripts that makes your experience on TPU better☆54Updated 3 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated 2 years ago
- ☆91Updated last year
- Load compute kernels from the Hub☆352Updated last week
- An implementation of the Llama architecture, to instruct and delight☆21Updated 6 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆64Updated this week
- Various transformers for FSDP research☆38Updated 3 years ago
- 👷 Build compute kernels☆195Updated this week
- Package of Pathways-on-Cloud utilities☆21Updated this week
- Experiment of using Tangent to autodiff triton☆81Updated last year
- ☆178Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆396Updated 6 months ago
- LM engine is a library for pretraining/finetuning LLMs☆102Updated this week
- ☆340Updated 2 weeks ago
- ☆21Updated 9 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆118Updated 3 months ago
- ☆47Updated last year
- ☆121Updated last year
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆459Updated 2 weeks ago
- ML/DL Math and Method notes☆65Updated 2 years ago
- seqax = sequence modeling + JAX☆169Updated 5 months ago
- ☆63Updated 3 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 5 months ago