yael-vinker / SketchAgentLinks

☆168

Alternatives and similar repositories for SketchAgent

Users that are interested in SketchAgent are comparing it to the libraries listed below

Sorting:

Vchitect / Evaluation-Agent
[ACL2025 Oral] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible
☆87Updated last month
HITsz-TMG / Anim-Director
Anim-Director: Controllable Animation Video Generation with Large Models-based Multimodal Agents
☆84Updated last month
Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆121Updated 4 months ago
aminK8 / KnobGen
CVPR 2025 Workshop on CVEU.
☆41Updated last month
g-luo / dual_process
Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025
☆85Updated last month
Pixtella / Anagram-MTL
[WACV 2025] Official implementation for the paper "Diffusion-based Visual Anagram as Multi-task Learning"
☆57Updated 2 months ago
KwaiVGI / GameFactory
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
☆345Updated 4 months ago
Kmcode1 / SG-I2V
This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.
☆110Updated 8 months ago
Jialuo-Li / Science-T2I
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
☆59Updated 3 months ago
zhenyuw16 / GenArtist
Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"
☆143Updated 9 months ago
sayakpaul / tt-scale-flux
Inference-time scaling of diffusion-based image and video generation models.
☆161Updated last month
ALEEEHU / World-Simulator
Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Gener…
☆281Updated this week
YangLing0818 / Trans4D
Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis
☆2Updated 10 months ago
showlab / Impossible-Videos
ICML 2025 - Impossible Videos
☆72Updated 2 weeks ago
invictus717 / InteractiveVideo
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
☆128Updated last year
csuhan / Tar
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
☆131Updated last month
stdstu12 / YUME
☆244Updated 2 weeks ago
theEricMa / TriplaneTurbo
[CVPR2025] Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
☆62Updated 3 months ago
YangLing0818 / IterComp
[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
☆193Updated 5 months ago
tomtom1103 / compose-and-conquer
[ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
☆104Updated last year
shiml20 / FlowTurbo
[NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"
☆71Updated 10 months ago
wenhao728 / awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…
☆242Updated 2 months ago
Correr-Zhou / MagicTailor
[IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models"…
☆87Updated 3 months ago
snap-research / weights2weights
Official Implementation of weights2weights
☆147Updated 5 months ago
ByteVisionLab / DetailFlow
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆148Updated last month
GameGen-X / GameGen-X
☆302Updated 2 months ago
jianzongwu / MotionBooth
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆136Updated 10 months ago
briannlongzhao / DreamDistribution
☆96Updated 3 months ago
alexanderswerdlow / unidisc
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆112Updated 4 months ago
EPFL-VILAB / ViPer
☆70Updated 10 months ago