Whalesong-zrs / Towards-Fine-grained-HBOE
The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).
☆16Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for Towards-Fine-grained-HBOE
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆18Updated last month
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆47Updated 2 months ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆82Updated 2 months ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆21Updated 3 weeks ago
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆110Updated 2 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆35Updated 7 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆36Updated 4 months ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆71Updated 3 weeks ago
- Accepted by CVPR 2024☆28Updated 6 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆91Updated 8 months ago
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆50Updated this week
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆23Updated 7 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆55Updated this week
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆28Updated 3 weeks ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆100Updated 4 months ago
- ☆123Updated last month
- Semantic Score Distillation Sampling for Compositional Text-to-3D Generation☆28Updated last month
- ☆21Updated 6 months ago
- Fine-tune VAE of Stable Diffusion model☆16Updated last month
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆75Updated 2 months ago
- ☆26Updated 4 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆141Updated last month
- ☆113Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 4 months ago
- This is the official implementation for ControlVAR.☆52Updated last month
- Training-Free Condition-Guided Text-to-Video Generation☆57Updated 10 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆25Updated 2 months ago