ForeverPs / IncrementalVHD_GPE
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
☆34Updated last year
Alternatives and similar repositories for IncrementalVHD_GPE:
Users that are interested in IncrementalVHD_GPE are comparing it to the libraries listed below
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆66Updated 11 months ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆21Updated 4 months ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆24Updated 2 weeks ago
- Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning☆28Updated last year
- JoyType: A Robust Design for Multilingual Visual Text Creation☆33Updated 4 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆11Updated 9 months ago
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆16Updated 3 weeks ago
- ☆12Updated 2 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆69Updated 8 months ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆49Updated last year
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆29Updated last year
- Code for Learning Subject-Aware Cropping by Outpainting Professional Photos☆16Updated last year
- [ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated last year
- Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)☆57Updated last year
- ☆37Updated 2 months ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆52Updated 11 months ago
- Chinese CLIP models with SOTA performance.☆54Updated last year
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆43Updated last month
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆33Updated last month
- [CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.☆51Updated last month
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆46Updated 4 months ago
- ☆142Updated 9 months ago
- ☆58Updated 7 months ago
- ☆27Updated 5 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆57Updated last week
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆66Updated 3 weeks ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆133Updated 2 months ago
- ☆17Updated last month
- ☆21Updated last year
- Text-To-Image Generation with Chinese Characters☆21Updated last year