ForeverPs / IncrementalVHD_GPE
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
☆26Updated 8 months ago
Related projects: ⓘ
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆39Updated 2 months ago
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆58Updated 5 months ago
- ☆92Updated 2 months ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆32Updated 2 months ago
- Precision Search through Multi-Style Inputs☆45Updated last month
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆29Updated last month
- Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated 8 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆46Updated 5 months ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆34Updated 2 months ago
- Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models☆18Updated last month
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆23Updated 5 months ago
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆93Updated last month
- ☆78Updated 8 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆32Updated 6 months ago
- Offical code repository of “BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training”☆54Updated last month
- ☆140Updated 2 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆36Updated 8 months ago
- Unified Multi-modal IAA Baseline and Benchmark☆68Updated 5 months ago
- The HD-VG-130M Dataset☆106Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆111Updated 4 months ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆98Updated last year
- Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆164Updated last month
- Text-To-Image Generation with Chinese Characters☆17Updated last year
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆107Updated 3 months ago
- [ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions☆112Updated 2 months ago
- ☆72Updated 8 months ago
- ☆34Updated 5 months ago
- [SIGGRAPH Asia 2024] Painting process generating using diffusion models☆46Updated last month
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆104Updated 2 months ago