google-deepmind / hierarchical_perceiverLinks
☆27Updated last month
Alternatives and similar repositories for hierarchical_perceiver
Users that are interested in hierarchical_perceiver are comparing it to the libraries listed below
Sorting:
- ☆104Updated 2 weeks ago
- ☆31Updated 7 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆51Updated 6 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- ☆32Updated last year
- FID computation in Jax/Flax.☆27Updated 11 months ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- ☆51Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Easy Hypernetworks in Pytorch and Jax☆101Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆39Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆54Updated 2 years ago
- ☆51Updated last year
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆30Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆100Updated 2 years ago
- ☆53Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 7 months ago
- ☆53Updated 8 months ago
- Building blocks for productive research☆58Updated 4 months ago
- Beyond Straight-Through☆97Updated 2 years ago
- NF-Layers for constructing neural functionals.☆86Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆129Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆46Updated 2 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated 5 months ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- A State-Space Model with Rational Transfer Function Representation.☆78Updated last year
- PyTorch Package For Quasimetric Learning☆42Updated 7 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 4 years ago