keshik6 / graftingLinks
Exploring Diffusion Transformer Designs via Grafting
☆48Updated 2 months ago
Alternatives and similar repositories for grafting
Users that are interested in grafting are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆27Updated 5 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆86Updated last week
- ☆37Updated 3 months ago
- ☆69Updated 9 months ago
- ☆53Updated last month
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆39Updated 5 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆45Updated 2 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆72Updated 5 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆38Updated 6 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆37Updated 2 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆166Updated 3 months ago
- [WIP🚧] 2025 up-to-date list of resources on visual tokenizers (primarily for visual generation). Give it a star 🌟 if you find it useful…☆16Updated 7 months ago
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆48Updated last year
- Scaling RWKV-Like Architectures for Diffusion Models☆137Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆178Updated last year
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆118Updated 3 months ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆124Updated 3 weeks ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆79Updated last month
- ☆34Updated 3 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆137Updated last month
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆34Updated last month
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆111Updated 2 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆30Updated 4 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆119Updated 5 months ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆83Updated last month
- Code for ICML 2025 Paper "Highly Compressed Tokenizer Can Generate Without Training"☆166Updated 2 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆47Updated last month
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆53Updated 9 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆103Updated last year