yael-vinker / SketchAgentLinks
☆179Updated 4 months ago
Alternatives and similar repositories for SketchAgent
Users that are interested in SketchAgent are comparing it to the libraries listed below
Sorting:
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆104Updated 2 months ago
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆205Updated 3 weeks ago
- Official PyTorch implementation of TokenSet.☆125Updated 7 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆424Updated 7 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆114Updated 11 months ago
- NEO Series: Native Vision-Language Models from First Principles☆180Updated this week
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆153Updated last year
- ☆319Updated 2 months ago
- [WACV 2025] Official implementation for the paper "Diffusion-based Visual Anagram as Multi-task Learning"☆57Updated 5 months ago
- [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation☆295Updated 3 months ago
- ☆166Updated this week
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆200Updated 8 months ago
- ICML 2025 - Impossible Videos☆77Updated 3 months ago
- Benchmarking physical understanding in generative video models☆207Updated 3 weeks ago
- Inference-time scaling of diffusion-based image and video generation models.☆169Updated 4 months ago
- CVPR 2025 Workshop on CVEU.☆42Updated 4 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆255Updated 7 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆101Updated last month
- Nano-consistent-150k☆213Updated last week
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 7 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆175Updated this week
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆61Updated 5 months ago
- Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translati…☆259Updated 5 months ago
- ☆25Updated 10 months ago
- Official Implementation of weights2weights☆147Updated 7 months ago
- Simulating the Real World: Survey & Resources, which contains our survey "Simulating the Real World: A Unified Survey of Multimodal Gener…☆301Updated this week
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆92Updated 2 weeks ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆300Updated 7 months ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆157Updated 3 months ago