och-mac / TraDiffusion
TraDiffusion: Trajectory-Based Training-Free Image Generation
☆51Updated 6 months ago
Alternatives and similar repositories for TraDiffusion
Users that are interested in TraDiffusion are comparing it to the libraries listed below
Sorting:
- The implementation of "An item is Worth a Prompt: Versatile Image Editing with Disentangled Control"☆73Updated 8 months ago
- ☆22Updated 5 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 9 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 5 months ago
- ☆83Updated 8 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆69Updated 7 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆46Updated 8 months ago
- Code release for AccDiffusion (ECCV 2024)☆83Updated 5 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 3 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 8 months ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 6 months ago
- ☆24Updated last year
- Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"☆48Updated 5 months ago
- ☆70Updated 7 months ago
- ☆3Updated 7 months ago
- ☆34Updated last year
- [IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models"…☆85Updated last week
- The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"☆53Updated 3 weeks ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆54Updated 3 months ago
- Distilling Diversity and Control in Diffusion Models☆39Updated 2 weeks ago
- ☆22Updated 4 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 2 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆81Updated 9 months ago
- Official Implementation of weights2weights☆141Updated 2 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆115Updated 3 months ago
- DiT for VAE (and Video Generation)☆32Updated 8 months ago
- Fine-tune of Florence-2 for shot categorization.☆24Updated 2 months ago
- Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆69Updated last month
- Official Implementation of GrounDiT (NeurIPS 2024)☆53Updated 5 months ago