ShenZhang-Shin / LEDiT
PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"
☆16Updated 3 weeks ago
Alternatives and similar repositories for LEDiT:
Users that are interested in LEDiT are comparing it to the libraries listed below
- ☆16Updated this week
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆26Updated last month
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆25Updated this week
- Official implementation of LaVin-DiT☆27Updated 2 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- The official pytorch implementation of “Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization”.☆12Updated 2 weeks ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆20Updated 5 months ago
- ☆38Updated last year
- Video Diffusion State Space Models☆19Updated last year
- [CVPR 2025] Test-Time Visual In-Context Tuning☆16Updated this week
- Official Code for 'TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes' (CVPR 2024)☆11Updated 9 months ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆26Updated 3 weeks ago
- The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"☆10Updated 2 months ago
- ☆10Updated 8 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆18Updated 3 months ago
- [CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"☆19Updated 3 weeks ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆17Updated 2 weeks ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆19Updated 3 weeks ago
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction'☆21Updated 2 weeks ago
- ☆16Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆42Updated last week
- Autoregressive Image Generation with Randomized Parallel Decoding☆35Updated last week
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆33Updated 3 weeks ago
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆18Updated 2 weeks ago
- OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆13Updated 3 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆31Updated 4 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆32Updated 2 weeks ago
- ☆14Updated 3 weeks ago
- 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors☆10Updated 6 months ago