google-deepmind / hierarchical_perceiverLinks
☆27Updated 3 months ago
Alternatives and similar repositories for hierarchical_perceiver
Users that are interested in hierarchical_perceiver are comparing it to the libraries listed below
Sorting:
- FID computation in Jax/Flax.☆28Updated last year
- ☆32Updated 9 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆54Updated 9 months ago
- ☆115Updated 2 months ago
- ☆52Updated last year
- ☆34Updated last year
- Easy Hypernetworks in Pytorch and Jax☆104Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 10 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆87Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Beyond Straight-Through☆102Updated 2 years ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- ☆51Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆107Updated 11 months ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆21Updated 10 months ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆55Updated 2 years ago
- A simple, performant and scalable JAX-based world modeling codebase☆70Updated last week
- ☆53Updated last year
- ☆57Updated 11 months ago
- Building blocks for productive research☆59Updated last month
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆34Updated last year
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆99Updated 3 weeks ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- ☆130Updated 2 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆131Updated last year