ShenZhang-Shin / LEDiTLinks
PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"
☆19Updated 4 months ago
Alternatives and similar repositories for LEDiT
Users that are interested in LEDiT are comparing it to the libraries listed below
Sorting:
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆12Updated 3 months ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆18Updated 4 months ago
- Official implementation of "Can Test-Time Scaling Improve World Foundation Model?"☆14Updated last week
- Official implementation of LaVin-DiT☆35Updated 5 months ago
- ☆23Updated last week
- Video Diffusion State Space Models☆19Updated last year
- Official implementation for "Diffusion Instruction Tuning"☆23Updated last month
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆33Updated 5 months ago
- Official Code for 'TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes' (CVPR 2024)☆11Updated last year
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆24Updated 9 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆20Updated 4 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆19Updated last month
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 8 months ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆20Updated last week
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated 2 months ago
- ☆37Updated last month
- The official code of "Image is All You Need to Empower Large-scale Diffusion Models for In-Domain Generation". [CVPR2025]☆20Updated 4 months ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Updated 2 weeks ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆37Updated 5 months ago
- [CVPR 2025 highlight] v-CLR: View-Consistent Learning for Open-World Instance Segmentation☆19Updated 3 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Updated last year
- ☆15Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆25Updated 8 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better☆32Updated last month
- Optimized -KAN-based Transformer Models Performance Test on Various Tasks (Point cloud&Vision)☆10Updated last year
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆34Updated 2 weeks ago
- ☆15Updated 3 months ago
- Official repository for TikTok-DeepFake (TT-DF)☆12Updated 5 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆106Updated 3 weeks ago