ForeverPs / IncrementalVHD_GPELinks
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
☆39Updated last year
Alternatives and similar repositories for IncrementalVHD_GPE
Users that are interested in IncrementalVHD_GPE are comparing it to the libraries listed below
Sorting:
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆84Updated last year
- Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)☆70Updated last year
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆102Updated 2 years ago
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆51Updated last year
- ☆46Updated 6 months ago
- Code for Learning Subject-Aware Cropping by Outpainting Professional Photos☆22Updated last year
- ☆72Updated 2 years ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38Updated 5 months ago
- ☆99Updated last year
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆32Updated 6 months ago
- Chinese CLIP models with SOTA performance.☆59Updated 2 years ago
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆69Updated last year
- Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning☆29Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆140Updated 9 months ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆197Updated 2 years ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆56Updated 2 months ago
- ☆95Updated last month
- Text-To-Image Generation with Chinese Characters☆130Updated 2 years ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Updated last year
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆161Updated 7 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆77Updated 7 months ago
- Text-To-Image Generation with Chinese Characters☆22Updated 2 years ago
- Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models☆47Updated 3 weeks ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆24Updated 11 months ago
- TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis☆83Updated last month
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆151Updated 11 months ago
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆65Updated 7 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆131Updated 6 months ago
- [ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated last year
- ☆183Updated 2 months ago