ShenZhang-Shin / LEDiT
PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"
☆19Updated last month
Alternatives and similar repositories for LEDiT:
Users that are interested in LEDiT are comparing it to the libraries listed below
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆9Updated 3 weeks ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆13Updated last month
- Video Diffusion State Space Models☆19Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 6 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 5 months ago
- ☆39Updated last year
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆30Updated last month
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆34Updated 10 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".☆38Updated 3 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆49Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 2 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆22Updated 6 months ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆19Updated last month
- ☆16Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆40Updated 2 weeks ago
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆20Updated 4 months ago
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆14Updated last month
- Official implementation for "Diffusion Instruction Tuning"☆22Updated 2 months ago
- Official implementation of LaVin-DiT☆31Updated 3 months ago
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆37Updated 7 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆18Updated 5 months ago
- OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆13Updated 4 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Updated 9 months ago
- ☆19Updated last week
- Autoregressive Image Generation with Randomized Parallel Decoding☆50Updated 3 weeks ago
- ☆23Updated last month
- ☆14Updated last week
- TPDiff: Temporal Pyramid Video Diffusion Model☆19Updated last month