modula-systems / modula
𧱠Modula software package
β132Updated this week
Alternatives and similar repositories for modula:
Users that are interested in modula are comparing it to the libraries listed below
- supporting pytorch FSDP for optimizersβ75Updated last month
- β146Updated last month
- A MAD laboratory to improve AI architecture designs π§ͺβ102Updated last month
- LoRA for arbitrary JAX models and functionsβ135Updated 10 months ago
- Efficient optimizersβ144Updated this week
- A simple library for scaling up JAX programsβ129Updated 2 months ago
- β75Updated 6 months ago
- Experiment of using Tangent to autodiff tritonβ74Updated 11 months ago
- β201Updated 6 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingβ121Updated 9 months ago
- WIPβ92Updated 5 months ago
- β53Updated 11 months ago
- Understand and test language model architectures on synthetic tasks.β175Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β90Updated last month
- Named Tensors for Legible Deep Learning in JAXβ157Updated last week
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditionβ¦β146Updated last month
- Implementation of PSGD optimizer in JAXβ26Updated 2 weeks ago
- β58Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β182Updated 7 months ago
- If it quacks like a tensor...β55Updated 2 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ119Updated this week
- Run PyTorch in JAX. π€β214Updated last week
- 94% on CIFAR-10 in 2.6 seconds π¨ 96% in 27 secondsβ195Updated last month
- Accelerated First Order Parallel Associative Scanβ169Updated 4 months ago
- β50Updated 3 months ago
- A library for unit scaling in PyTorchβ118Updated last month
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overheadβ210Updated last week
- JAX Synergistic Memory Inspectorβ164Updated 6 months ago
- β40Updated last month
- This is a port of Mistral-7B model in JAXβ30Updated 6 months ago