MatX-inc / seqaxLinks

seqax = sequence modeling + JAX

☆165

Alternatives and similar repositories for seqax

Users that are interested in seqax are comparing it to the libraries listed below

Sorting:

google-deepmind / nanodo
☆274Updated last year
young-geng / scalax
A simple library for scaling up JAX programs
☆140Updated 9 months ago
jax-ml / jax-llm-examples
☆137Updated last week
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated last week
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆123Updated 7 months ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆118Updated last year
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
stanford-crfm / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆627Updated this week
modula-systems / modula
🧱 Modula software package
☆210Updated this week
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆205Updated last year
ayaka14732 / jax-smi
JAX Synergistic Memory Inspector
☆177Updated last year
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆55Updated last year
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆166Updated last year
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 7 months ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 2 weeks ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
jenkspt / gpt-jax
Jax/Flax rewrite of Karpathy's nanoGPT
☆59Updated 2 years ago
AllanYangZhou / midGPT
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆24Updated 10 months ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆149Updated last month
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆184Updated 11 months ago
stanford-crfm / haliax
Named Tensors for Legible Deep Learning in JAX
☆194Updated this week
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆128Updated 3 weeks ago
nshepperd / flash_attn_jax
JAX bindings for Flash Attention v2
☆90Updated this week
google / praxis
☆187Updated 2 weeks ago
google / tunix
A JAX-native LLM Post-Training Library
☆76Updated this week
m-a-n-i-f-e-s-t / power-attention
Attention Kernels for Symmetric Power Transformers
☆102Updated last week
google-research / jaxpruner
☆232Updated 5 months ago
jax-ml / jax-triton
jax-triton contains integrations between JAX and OpenAI Triton
☆411Updated last month
evanatyourservice / psgd_jax
Implementation of PSGD optimizer in JAX
☆34Updated 7 months ago