modula-systems / modulaLinks

🧱 Modula software package

☆307

Alternatives and similar repositories for modula

Users that are interested in modula are comparing it to the libraries listed below

Sorting:

google-deepmind / nanodo
☆285Updated last year
HomebrewML / HeavyBall
Efficient optimizers
☆276Updated 3 weeks ago
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆204Updated 2 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 11 months ago
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆685Updated last week
microsoft / dion
Dion optimizer algorithm
☆388Updated 2 weeks ago
nikhilvyas / SOAP
☆224Updated last year
KellerJordan / cifar10-airbench
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆327Updated 2 weeks ago
young-geng / scalax
A simple library for scaling up JAX programs
☆144Updated 3 weeks ago
marin-community / haliax
Named Tensors for Legible Deep Learning in JAX
☆212Updated 3 weeks ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆174Updated 5 months ago
fferflo / einx
Universal Notation for Tensor Operations in Python.
☆449Updated 7 months ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆401Updated this week
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆143Updated last year
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆188Updated last month
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆192Updated last year
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆175Updated last year
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆132Updated 4 months ago
kvfrans / splus
☆119Updated 5 months ago
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆106Updated last week
iliao2345 / CompressARC
☆201Updated 3 months ago
evanatyourservice / kron_torch
An implementation of PSGD Kron second-order optimizer for PyTorch
☆97Updated 4 months ago
cloneofsimo / min-fsdp
☆91Updated last year
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆135Updated 11 months ago
wilson-labs / cola
Compositional Linear Algebra
☆498Updated 4 months ago
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated this week
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago
evanatyourservice / psgd_jax
Implementation of PSGD optimizer in JAX
☆35Updated 11 months ago
m-a-n-i-f-e-s-t / power-attention
Attention Kernels for Symmetric Power Transformers
☆128Updated 2 months ago