thu-ml / RIFLExLinks

Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)

☆736

Alternatives and similar repositories for RIFLEx

Users that are interested in RIFLEx are comparing it to the libraries listed below

Sorting:

Alpha-VLLM / Lumina-Video
☆412Updated 8 months ago
alibaba / Tora
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
☆1,208Updated 4 months ago
360CVGroup / FancyVideo
Video generation from text&image, 1st-gen
☆921Updated 6 months ago
Vchitect / Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
☆916Updated 7 months ago
SkyworkAI / UniPic
Unified Multimodal Model for image generation/editing/understanding
☆802Updated 2 months ago
NJU-PCALab / RAG-Diffusion
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
☆614Updated 4 months ago
Alpha-VLLM / Lumina-mGPT-2.0
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
☆1,070Updated last week
damo-cv / RealisDance
The official implementation of RealisDance
☆605Updated 4 months ago
Doby-Xu / WithAnyone
✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images
☆467Updated last week
XueZeyue / DanceGRPO
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
☆1,181Updated 3 weeks ago
rhymes-ai / Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…
☆1,105Updated 9 months ago
Tencent-Hunyuan / MixGRPO
MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE
☆1,033Updated last month
PKU-YuanGroup / ConsisID
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
☆782Updated 2 months ago
Alpha-VLLM / Lumina-DiMOO
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
☆872Updated last week
ali-vilab / UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
☆808Updated 6 months ago
bytedance / ComfyUI_InfiniteYou
🔥 [ICCV 2025 Highlight] Official ComfyUI native node supporting InfiniteYou with FLUX
☆281Updated 3 months ago
bytedance / XVerse
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulatio…
☆611Updated 3 weeks ago
KwaiVGI / 3DTrajMaster
[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation
☆359Updated 4 months ago
sy77777en / CameraBench
[NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video
☆244Updated 3 weeks ago
fudan-generative-vision / OpenHumanVid
[CVPR 2025] A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
☆291Updated 8 months ago
zibojia / COCOCO
Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, C…
☆312Updated last year
megvii-research / megactor
☆901Updated 11 months ago
IDKiro / sdxs
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
☆641Updated last year
thu-ml / cond-image-leakage
Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)
☆256Updated 6 months ago
yrcong / flatten
Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)
☆208Updated last year
JavisVerse / JavisDiT
Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"
☆285Updated last month
360CVGroup / Qihoo-T2X
Efficient DiT architecture for text2any tasks, ICLR2025
☆450Updated 6 months ago
aigc3d / AniGS
[CVPR2025] AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
☆449Updated 8 months ago
showlab / Show-1
[IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
☆1,132Updated 2 months ago
ali-vilab / UniAnimate
Code for SCIS-2025 Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".
☆1,172Updated 7 months ago