yael-vinker / SketchAgentLinks
β154Updated 3 weeks ago
Alternatives and similar repositories for SketchAgent
Users that are interested in SketchAgent are comparing it to the libraries listed below
Sorting:
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"β134Updated 8 months ago
- π₯ Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"β111Updated last week
- Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexibleβ69Updated last week
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.β109Updated 7 months ago
- [CVPR2025] Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Dataβ62Updated 2 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesisβ104Updated last year
- β25Updated 6 months ago
- Benchmarking physical understanding in generative video modelsβ176Updated last month
- CVPR 2025 Workshop on CVEU.β40Updated 2 weeks ago
- Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesisβ2Updated 8 months ago
- [IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models"β¦β88Updated last month
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharingβ58Updated 6 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesisβ56Updated 2 months ago
- [ARXIV'25] GameFactory: Creating New Games with Generative Interactive Videosβ309Updated 3 months ago
- Official PyTorch implementation of TokenSet.β121Updated 3 months ago
- ICML 2025 - Impossible Videosβ68Updated 3 weeks ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"β133Updated 8 months ago
- Anim-Director: Controllable Animation Video Generation with Large Models-based Multimodal Agentsβ80Updated 2 weeks ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generationβ190Updated 4 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!β113Updated 3 months ago
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidanceβ127Updated 8 months ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learningβ251Updated 2 months ago
- β66Updated last year
- β97Updated 2 months ago
- EditWorld: Simulating World Dynamics for Instruction-Following Image Editingβ131Updated last year
- Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Generβ¦β259Updated this week
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"β253Updated last month
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ164Updated 3 months ago
- β131Updated 3 months ago
- DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generationβ160Updated 2 weeks ago