Whalesong-zrs / Towards-Fine-grained-HBOE
The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).
☆16Updated 10 months ago
Alternatives and similar repositories for Towards-Fine-grained-HBOE:
Users that are interested in Towards-Fine-grained-HBOE are comparing it to the libraries listed below
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆20Updated 2 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆65Updated this week
- A collection of vision foundation models unifying understanding and generation.☆42Updated 2 months ago
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆31Updated 2 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆107Updated 3 months ago
- ☆21Updated 9 months ago
- This is the official implementation for ControlVAR.☆95Updated 2 months ago
- Implements VAR+CLIP for text-to-image (T2I) generation☆124Updated last month
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆37Updated 10 months ago
- My implement of InstantBooth☆9Updated last year
- [CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition☆139Updated last month
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆273Updated this week
- The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".☆105Updated 5 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆36Updated last week
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆84Updated 6 months ago
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆199Updated last month
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆22Updated this week
- Liquid: Language Models are Scalable and Unified Multi-modal Generators☆67Updated this week
- [ICLR2025]☆138Updated last month
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation☆23Updated 11 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated 11 months ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆87Updated 4 months ago
- ☆11Updated 2 months ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆32Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆79Updated 3 weeks ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆61Updated this week
- ☆20Updated last year
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆68Updated 5 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆117Updated last month
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆52Updated 2 weeks ago