lyk412 / Consistent123
[ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors
☆16Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Consistent123
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- The official implementation of Hierarchical Semantic Decoding with Counting Assitance for Generalized Referring Expression Segmentation☆16Updated 5 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- [TPAMI2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆17Updated last month
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆21Updated 4 months ago
- Diffusion Feedback Helps CLIP See Better☆214Updated 2 months ago
- This is the official implementation for ControlVAR.☆52Updated last month
- ☆31Updated last month
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆59Updated last month
- ☆26Updated 3 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆90Updated 7 months ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆10Updated 3 weeks ago
- ☆49Updated 2 weeks ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆46Updated 2 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆58Updated 3 weeks ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆25Updated 2 months ago
- ☆110Updated 4 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆48Updated last month
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆12Updated 3 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)☆88Updated 5 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆206Updated 3 weeks ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆64Updated 5 months ago
- Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"☆15Updated last year
- Unified Multi-modal IAA Baseline and Benchmark☆70Updated last month
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆89Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆109Updated 2 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆78Updated 8 months ago