para-lost/AutoPresent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/para-lost/AutoPresent)

para-lost / AutoPresent

Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)

☆173

Alternatives and similar repositories for AutoPresent

Users that are interested in AutoPresent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vinsontang1 / SlideCoder
View on GitHub
The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"
☆47Oct 20, 2025Updated 8 months ago
sahilg06 / Awesome-Aesthetics-Assessment
View on GitHub
Collection of Aesthetics Assessment Papers for Graphic Designs.
☆44Mar 25, 2026Updated 3 months ago
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
HumanMLLM / LOVE-R1
View on GitHub
Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"
☆24Nov 1, 2025Updated 8 months ago
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hany01rye / tiger
View on GitHub
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
☆22Nov 18, 2025Updated 7 months ago
Chenyu-Wang567 / All-Angles-Bench
View on GitHub
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
☆68Mar 22, 2026Updated 3 months ago
wangqiang9 / Awesome-RLHF-Video-Diffusion
View on GitHub
RLHF for Video Diffusion Models
☆26Jul 30, 2025Updated 11 months ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
tsunghan-wu / reverse_vlm
View on GitHub
🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…
☆58Jan 22, 2026Updated 5 months ago
CyberAgentAILab / sprite-decompose
View on GitHub
[ECCV2024] Fast Sprite Decomposition from Animated Graphics
☆31Sep 26, 2024Updated last year
Arking1995 / COHO
View on GitHub
[ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation
☆13Aug 13, 2024Updated last year
TAU-VAILab / isbertblind
View on GitHub
This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…
☆21Nov 2, 2023Updated 2 years ago
kingnobro / Chat2SVG
View on GitHub
(CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"
☆238Apr 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆15Jul 4, 2025Updated last year
mayu-ot / ltsim
View on GitHub
☆17Mar 24, 2025Updated last year
JunjieYang97 / Meta-ControlNet
View on GitHub
☆31Jan 7, 2024Updated 2 years ago
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
kongdai123 / consistency2
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
haoningwu3639 / SpatialScore
View on GitHub
[CVPR 2026 Highlight] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
☆84May 28, 2026Updated last month
lmarena / search-arena
View on GitHub
⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".
☆57Feb 23, 2026Updated 4 months ago
mu-cai / TemporalBench
View on GitHub
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
☆40Nov 10, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
qq456cvb / 3DCorrEnhance
View on GitHub
☆37Jun 13, 2026Updated 3 weeks ago
likaixin2000 / MMCode
View on GitHub
[EMNLP 2024] Multi-modal reasoning problems via code generation.
☆28Apr 14, 2026Updated 2 months ago
Alibaba-NLP / E2Rank
View on GitHub
E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
☆57Jul 1, 2026Updated last week
paintscene4d / paintscene4d.github.io
View on GitHub
☆25Mar 30, 2025Updated last year
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆10Jul 19, 2024Updated last year
visual-haystacks / mirage
View on GitHub
🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
☆27Feb 9, 2025Updated last year
see-say-segment / sesame
View on GitHub
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆47Jun 16, 2024Updated 2 years ago
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
xjywhu / Awesome-Multimodal-LLM-for-Code
View on GitHub
Multimodal Large Language Models for Code Generation under Multimodal Scenarios
☆270Jun 25, 2026Updated 2 weeks ago
pgasawa / BARE
View on GitHub
Leveraging Base Language Models for Few-Shot Synthetic Data Generation
☆41Oct 18, 2025Updated 8 months ago
xiaohangt / wd1
View on GitHub
Official Implementation of wd1
☆32Sep 25, 2025Updated 9 months ago
GLUS-video / GLUS
View on GitHub
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆70Jun 23, 2025Updated last year
crcrpar / instance_normalization_chainer
View on GitHub
chainer v2 implementation of instance normalization
☆11Aug 8, 2018Updated 7 years ago
jama1017 / MoVer
View on GitHub
Official implementation of the paper MoVer: Motion Verification for Motion Graphics Animations (SIGGRAPH 2025)
☆37May 3, 2026Updated 2 months ago