young-geng / tpu_pod_commanderLinks
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Updated 11 months ago
Alternatives and similar repositories for tpu_pod_commander
Users that are interested in tpu_pod_commander are comparing it to the libraries listed below
Sorting:
- Minimal but scalable implementation of large language models in JAX☆34Updated 7 months ago
- If it quacks like a tensor...☆58Updated 6 months ago
- A simple library for scaling up JAX programs☆137Updated 7 months ago
- LoRA for arbitrary JAX models and functions☆136Updated last year
- General Modules for JAX☆66Updated 2 months ago
- ☆29Updated 6 months ago
- Building blocks for productive research☆55Updated 4 months ago
- Machine Learning eXperiment Utilities☆46Updated 11 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆32Updated last year
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆29Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- ☆34Updated 2 years ago
- ☆18Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- flexible meta-learning in jax☆14Updated last year
- ☆19Updated 3 weeks ago
- Scaling scaling laws with board games.☆49Updated last year
- Implementation of PSGD optimizer in JAX☆33Updated 5 months ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆14Updated 10 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 6 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated last week
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆28Updated last year
- Learn online intrinsic rewards from LLM feedback☆37Updated 5 months ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- ☆47Updated last year
- ☆17Updated 9 months ago