srush / anynpLinks
Proof-of-concept of global switching between numpy/jax/pytorch in a library.
☆18Updated last year
Alternatives and similar repositories for anynp
Users that are interested in anynp are comparing it to the libraries listed below
Sorting:
- Experiment of using Tangent to autodiff triton☆80Updated last year
- ☆21Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year
- nanoGPT-like codebase for LLM training☆108Updated 4 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆72Updated 3 months ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
- A simple library for scaling up JAX programs☆143Updated 11 months ago
- Minimal yet performant LLM examples in pure JAX☆184Updated 3 weeks ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆63Updated 4 years ago
- ☆91Updated last year
- ☆115Updated last month
- This is a port of Mistral-7B model in JAX☆32Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆60Updated this week
- Einsum-like high-level array sharding API for JAX☆35Updated last year
- Named Tensors for Legible Deep Learning in JAX☆207Updated this week
- Running Jax in PyTorch Lightning☆112Updated 9 months ago
- Neural Networks for JAX☆84Updated last year
- ☆13Updated 4 months ago
- ☆38Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- seqax = sequence modeling + JAX☆167Updated 2 months ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- Graph neural networks in JAX.☆68Updated last year
- Train very large language models in Jax.☆209Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- Resources from the EleutherAI Math Reading Group☆54Updated 7 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆183Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆164Updated 3 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆148Updated last month