UCSC-VLAA / CRATE-alphaLinks
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
☆47Updated last year
Alternatives and similar repositories for CRATE-alpha
Users that are interested in CRATE-alpha are comparing it to the libraries listed below
Sorting:
- [CVPR 2024 Highlight] ImageNet-D☆44Updated last year
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆46Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆39Updated 8 months ago
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆182Updated 4 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆61Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆108Updated 3 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆107Updated last month
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆36Updated 4 months ago
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆88Updated 3 months ago
- Adapting LLaMA Decoder to Vision Transformer☆30Updated last year
- ☆112Updated last year
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision