Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)
☆157May 26, 2025Updated 9 months ago
Alternatives and similar repositories for AutoPresent
Users that are interested in AutoPresent are comparing it to the libraries listed below
Sorting:
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆42Oct 20, 2025Updated 4 months ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- Recursive Visual Programming (ECCV 2024)☆18Nov 20, 2024Updated last year
- Collection of Aesthetics Assessment Papers for Graphic Designs.☆35Aug 29, 2025Updated 6 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆65Jan 1, 2026Updated 2 months ago
- ☆19Sep 19, 2024Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 11 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- ☆31Jan 7, 2024Updated 2 years ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 9 months ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆99Feb 2, 2026Updated last month
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆54Jan 22, 2026Updated last month
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- [CVPR 2026] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence☆65Jul 9, 2025Updated 8 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆11Nov 30, 2025Updated 3 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- ☆15Jan 12, 2026Updated last month
- ☆15Mar 18, 2025Updated 11 months ago
- ☆13Aug 14, 2022Updated 3 years ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆28Oct 20, 2025Updated 4 months ago
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆22Nov 18, 2025Updated 3 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- ☆37Jan 8, 2026Updated 2 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆66Jun 23, 2025Updated 8 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion☆47Feb 21, 2025Updated last year
- ☆13Jul 10, 2024Updated last year
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- ☆21Jun 26, 2025Updated 8 months ago
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆34Jun 13, 2025Updated 8 months ago