srush / anynpLinks

Proof-of-concept of global switching between numpy/jax/pytorch in a library.

☆18

Alternatives and similar repositories for anynp

Users that are interested in anynp are comparing it to the libraries listed below

Sorting:

srush / Tensor-Puzzles-Penzai
☆21Updated last year
yixiaoer / mistral-v0.2-jax
JAX implementation of the Mistral 7b v0.2 model
☆35Updated last year
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆110Updated 2 weeks ago
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆188Updated last month
edwardjhu / TP4
Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)
☆63Updated 4 years ago
modula-systems / modula
🧱 Modula software package
☆303Updated 3 months ago
ludwigwinkler / JaxLightning
Running Jax in PyTorch Lightning
☆114Updated 11 months ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
cgarciae / einop
☆60Updated 3 years ago
shikaiqiu / compute-better-spent
☆61Updated last year
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago
AakashKumarNain / mistral_jax
This is a port of Mistral-7B model in JAX
☆32Updated last year
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated last year
JesseFarebro / flax-mup
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Updated last year
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆62Updated last month
dylandoblar / noether-networks
Meta-learning inductive biases in the form of useful conserved quantities.
☆38Updated 3 years ago
cloneofsimo / min-fsdp
☆91Updated last year
google / drjax
☆15Updated last month
marin-community / haliax
Named Tensors for Legible Deep Learning in JAX
☆211Updated 2 weeks ago
young-geng / scalax
A simple library for scaling up JAX programs
☆144Updated 2 weeks ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆86Updated 3 years ago
google-deepmind / dks
Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…
☆74Updated 4 months ago
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆89Updated last year
cgarciae / ciclo
A functional training loops library for JAX
☆88Updated last year
cgarciae / nnx
Neural Networks for JAX
☆84Updated last year
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
srush / do-we-need-attention
☆166Updated 2 years ago
srush / mamba-primer
☆38Updated last year
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year