jxbz / agdLinks

Automatic gradient descent

☆217

Alternatives and similar repositories for agd

Users that are interested in agd are comparing it to the libraries listed below

Sorting:

google-deepmind / tf2jax
☆120Updated 2 weeks ago
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆300Updated last year
google-deepmind / synjax
☆251Updated 7 months ago
facebookresearch / torchdim
Named tensors with first-class dimensions for PyTorch
☆332Updated 2 years ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆190Updated 3 years ago
cgarciae / nnx
Neural Networks for JAX
☆84Updated last year
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆145Updated last year
KindXiaoming / BIMT
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆175Updated 2 years ago
DarshanDeshpande / jax-models
Unofficial JAX implementations of deep learning research papers
☆161Updated 3 years ago
srush / raspy
An interactive exploration of Transformer programming.
☆271Updated 2 years ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆210Updated 2 years ago
google-research / jaxpruner
☆234Updated last year
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆190Updated last month
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆242Updated 2 years ago
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆95Updated 2 years ago
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆217Updated last year
RobertRiachi / nanoPALM
☆144Updated 2 years ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆406Updated last week
awf / functional-transformer
A pure-functional implementation of a machine learning transformer model in Python/JAX
☆181Updated 9 months ago
ludwigwinkler / JaxLightning
Running Jax in PyTorch Lightning
☆119Updated last year
google / autobound
AutoBound automatically computes upper and lower bounds on functions.
☆364Updated 3 months ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆522Updated 2 years ago
IDSIA / modern-srwm
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…
☆175Updated 8 months ago
google-deepmind / compressed_vision
☆130Updated 2 years ago
google-deepmind / nanodo
☆291Updated last year
cgarciae / einop
☆60Updated 3 years ago
bhoov / hamux
Hierarchical Associative Memory User Experience
☆106Updated 3 weeks ago
cgarciae / ciclo
A functional training loops library for JAX
☆88Updated last year
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated 2 years ago
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆184Updated last year