cgarciae / dyn_plot
☆13Updated 2 months ago
Related projects: ⓘ
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆31Updated 3 months ago
- ☆23Updated this week
- ☆56Updated 2 years ago
- Fine-grained, dynamic control of neural network topology in JAX.☆21Updated last year
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆45Updated last month
- This is a port of Mistral-7B model in JAX☆29Updated 2 months ago
- ☆18Updated 5 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆29Updated last year
- ☆40Updated 2 months ago
- Automatically take good care of your preemptible TPUs☆28Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated last year
- ☆20Updated this week
- ☆11Updated this week
- Einsum-like high-level array sharding API for JAX☆31Updated 2 months ago
- FID computation in Jax/Flax.☆23Updated 2 months ago
- ☆28Updated last week
- ☆25Updated 5 months ago
- Personal solutions to the Triton Puzzles☆11Updated 2 months ago
- ☆27Updated this week
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 3 weeks ago
- The 2D discrete wavelet transform for JAX☆36Updated last year
- ☆17Updated 4 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆41Updated 3 months ago
- Minimum Description Length probing for neural network representations☆15Updated 11 months ago
- Wraps PyTorch code in a JIT-compatible way for JAX. Supports automatically defining gradients for reverse-mode AutoDiff.☆34Updated last month
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆13Updated 3 weeks ago
- ☆13Updated this week
- ☆42Updated 3 months ago
- Open source code for EigenGame.☆28Updated last year