google-deepmind / hierarchical_perceiverLinks
☆27Updated 2 months ago
Alternatives and similar repositories for hierarchical_perceiver
Users that are interested in hierarchical_perceiver are comparing it to the libraries listed below
Sorting:
- ☆121Updated 6 months ago
- ☆35Updated last year
- ☆53Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated last year
- FID computation in Jax/Flax.☆29Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆114Updated last year
- ☆52Updated last year
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Updated last year
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆40Updated last year
- ☆53Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 4 years ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆57Updated last year
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- ☆56Updated last year
- ☆34Updated last year
- Easy Hypernetworks in Pytorch and Jax☆106Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆21Updated 2 months ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆76Updated 4 years ago
- Official code for the paper "Attention as a Hypernetwork"☆46Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆37Updated 2 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆20Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- PyTorch Package For Quasimetric Learning☆44Updated last year
- Building blocks for productive research☆64Updated 4 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Beyond Straight-Through☆105Updated 2 years ago
- [ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"☆28Updated 6 months ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago