modula-systems / modulaLinks
π§± Modula software package
β202Updated 3 months ago
Alternatives and similar repositories for modula
Users that are interested in modula are comparing it to the libraries listed below
Sorting:
- β273Updated 11 months ago
- LoRA for arbitrary JAX models and functionsβ140Updated last year
- A simple library for scaling up JAX programsβ139Updated 8 months ago
- β195Updated 7 months ago
- supporting pytorch FSDP for optimizersβ82Updated 7 months ago
- Efficient optimizersβ232Updated last week
- seqax = sequence modeling + JAXβ163Updated 3 weeks ago
- β132Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β141Updated 2 weeks ago
- Named Tensors for Legible Deep Learning in JAXβ185Updated this week
- β110Updated last month
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditionβ¦β179Updated last month
- A MAD laboratory to improve AI architecture designs π§ͺβ123Updated 6 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 secondsβ256Updated 4 months ago
- Implementation of PSGD optimizer in JAXβ33Updated 6 months ago
- Minimal but scalable implementation of large language models in JAXβ35Updated last week
- Universal Tensor Operations in Einstein-Inspired Notation for Python.β385Updated 3 months ago
- β79Updated last year
- JAX Synergistic Memory Inspectorβ175Updated 11 months ago
- Run PyTorch in JAX. π€β253Updated last week
- β229Updated 5 months ago
- Understand and test language model architectures on synthetic tasks.β219Updated last month
- Accelerated First Order Parallel Associative Scanβ182Updated 10 months ago
- Cost aware hyperparameter tuning algorithmβ162Updated last year
- A library for unit scaling in PyTorchβ125Updated 7 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorchβ92Updated 3 months ago
- If it quacks like a tensor...β58Updated 7 months ago
- Pytorch-like dataloaders for JAX.β90Updated last month
- WIPβ93Updated 10 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)β94Updated 7 months ago