keshik6 / graftingLinks
Exploring Diffusion Transformer Designs via Grafting
☆45Updated 3 weeks ago
Alternatives and similar repositories for grafting
Users that are interested in grafting are comparing it to the libraries listed below
Sorting:
- The official repo of continuous speculative decoding☆27Updated 3 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆73Updated 2 weeks ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆45Updated last week
- ☆70Updated 7 months ago
- ☆37Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated 2 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 8 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 4 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆52Updated 7 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆37Updated 4 months ago
- RS-IMLE☆41Updated 7 months ago
- Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆73Updated 3 weeks ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆105Updated 3 weeks ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆47Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆115Updated last week
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆24Updated 2 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆76Updated 7 months ago
- ☆17Updated 6 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆37Updated 5 months ago
- Official Implementation of LaViDa: :A Large Diffusion Language Model for Multimodal Understanding☆115Updated 3 weeks ago
- ☆49Updated last week
- Official implementation for "Diffusion Instruction Tuning"☆23Updated last month
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆44Updated 3 months ago
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 4 months ago
- ☆64Updated 3 weeks ago
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- Autoregressive Image Generation with Randomized Parallel Decoding☆68Updated 3 months ago