StanfordMIMI / LieRELinks
[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.
☆28Updated 5 months ago
Alternatives and similar repositories for LieRE
Users that are interested in LieRE are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Implementation of Spatial Reasoning with Denoising Models☆85Updated 5 months ago
- [NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting☆69Updated last week
- ☆71Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆110Updated 5 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Updated last year
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆30Updated last year
- Diffusion Models as Data Mining Tools☆56Updated 8 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated last year
- This is the official code release for [LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors](https://arxiv…☆44Updated last year
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆37Updated last year
- Code for Principal Masked Autoencoders☆30Updated 9 months ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆183Updated last year
- ☆72Updated 5 months ago
- ☆66Updated 5 months ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆44Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated 2 years ago
- ☆40Updated last year
- ☆53Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- The official repo of continuous speculative decoding☆31Updated 9 months ago
- More dimensions = More fun☆26Updated last year
- Implementation of a multimodal diffusion transformer in Pytorch☆107Updated last year
- Official repository for the article Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models (https://arxiv.org/…☆35Updated 4 months ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆25Updated 5 months ago
- TIPS (ICLR'25): Text-Image Pretraining with Spatial Awareness☆112Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- High order Moment Models☆42Updated 2 months ago
- ☆67Updated last month
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆52Updated last year