stepfun-ai / Step1X-EditLinks
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
β1,426Updated this week
Alternatives and similar repositories for Step1X-Edit
Users that are interested in Step1X-Edit are comparing it to the libraries listed below
Sorting:
- π₯π₯ UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioningβ1,118Updated 2 months ago
- A minimal and universal controller for FLUX.1.β1,639Updated 2 weeks ago
- Official repository of In-Context LoRA for Diffusion Transformersβ1,913Updated 6 months ago
- Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistenβ¦β1,718Updated last month
- β990Updated last month
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignmentβ1,196Updated 2 weeks ago
- Illumination Drawing Tools for Text-to-Image Diffusion Modelsβ768Updated last month
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideoβ1,507Updated last month
- πΉ A more flexible framework that can generate videos at any resolution and creates videos from images.β1,106Updated this week
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Modelβ875Updated last week
- πΊ An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusionβ2,170Updated 3 months ago
- β720Updated 6 months ago
- β1,177Updated 2 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generationβ1,072Updated last week
- β748Updated 4 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRAβ1,577Updated 8 months ago
- StoryMaker: Towards consistent characters in text-to-image generationβ700Updated 6 months ago
- [ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Videoβ1,260Updated 3 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformersβ528Updated 2 weeks ago
- A pipeline parallel training script for diffusion models.β1,154Updated this week
- Lumina-Image 2.0: A Unified and Efficient Image Generative Frameworkβ726Updated 3 weeks ago
- β521Updated 5 months ago
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesisβ1,356Updated last month
- CogView4, CogView3-Plus and CogView3(ECCV 2024)β1,063Updated 2 months ago
- [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"β398Updated 2 months ago
- SkyReels-A2: Compose anything in video diffusion transformersβ612Updated 2 weeks ago
- β548Updated this week
- β439Updated 2 weeks ago
- [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"β565Updated 5 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.β748Updated 6 months ago