UCSC-VLAA / CRATE-alpha
This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
☆45Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for CRATE-alpha
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆34Updated 5 months ago
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆31Updated 2 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 4 months ago
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?☆28Updated 7 months ago
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated 6 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆50Updated this week
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆72Updated 2 months ago
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆64Updated 5 months ago
- ☆48Updated 5 months ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆80Updated 10 months ago
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆29Updated last week
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆57Updated last week
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆110Updated 3 months ago
- Official pytorch repository for “Guidance with Spherical Gaussian Constraint for Conditional Diffusion”☆43Updated 4 months ago
- Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".☆47Updated 6 months ago
- ☆101Updated 8 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆40Updated 4 months ago
- ☆33Updated 4 months ago
- [NeurIPS 2024] Efficient Multi-modal Models via Stage-wise Visual Context Compression☆41Updated 3 months ago
- ☆105Updated 3 months ago
- Open source implementation of "Vision Transformers Need Registers"☆143Updated last week
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆30Updated 5 months ago
- ☆109Updated 5 months ago
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆51Updated last month
- Official repository of paper "Subobject-level Image Tokenization"☆62Updated 6 months ago
- ☆52Updated last year
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆77Updated 8 months ago
- Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"☆64Updated last year