lyk412 / Consistent123Links
[ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors
☆23Updated 8 months ago
Alternatives and similar repositories for Consistent123
Users that are interested in Consistent123 are comparing it to the libraries listed below
Sorting:
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆38Updated 3 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆101Updated last month
- Implements VAR+CLIP for text-to-image (T2I) generation☆141Updated 5 months ago
- A collection of vision foundation models unifying understanding and generation.☆56Updated 6 months ago
- This is the official implementation for ControlVAR.☆116Updated 7 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆126Updated last month
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆120Updated 7 months ago
- ICML2025☆49Updated last month
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆12Updated 9 months ago
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆50Updated last month
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆15Updated 4 months ago
- Frequency Autoregressive Image Generation with Continuous Tokens☆79Updated last month
- High-performance Image Tokenizers for VAR and AR☆275Updated 2 months ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆114Updated 8 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆141Updated last month
- Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"☆32Updated 4 months ago
- ☆50Updated 7 months ago
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆351Updated last week
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆166Updated 3 months ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆69Updated last week
- [ICLR2025] The code of Z-Sampling, proposed in our paper "Zigzag Diffusion Sampling: Diffusion Models Can Self-Improve via Self-Reflectio…☆80Updated 4 months ago
- [CVPR 2025 (Oral)] Open implementation of "RandAR"☆177Updated 3 months ago
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆124Updated 2 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆102Updated 3 months ago
- VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆52Updated last month
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆79Updated 2 months ago
- Official implementation of "STAR: Scale-wise Text-to-image generation via Auto-Regressive representations"☆35Updated 4 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆34Updated 3 months ago
- a collection of awesome autoregressive visual generation models☆74Updated 2 months ago
- USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆71Updated 2 weeks ago