ForeverPs / IncrementalVHD_GPELinks
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
☆40Updated 2 years ago
Alternatives and similar repositories for IncrementalVHD_GPE
Users that are interested in IncrementalVHD_GPE are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)☆72Updated last year
- Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning☆29Updated 2 years ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆86Updated last year
- ☆47Updated 9 months ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆105Updated 2 years ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆211Updated 2 years ago
- Code for Learning Subject-Aware Cropping by Outpainting Professional Photos☆26Updated 2 years ago
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆65Updated 10 months ago
- [ICLR 2023] Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated 2 years ago
- ☆100Updated 2 years ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆146Updated last year
- Text-To-Image Generation with Chinese Characters☆23Updated 2 weeks ago
- ☆82Updated 2 years ago
- ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations☆33Updated 9 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆79Updated 10 months ago
- Text-To-Image Generation with Chinese Characters☆132Updated 2 years ago
- TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis☆88Updated 4 months ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated last year
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆25Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆52Updated 2 years ago
- The official implementation of our paper "Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption"☆38Updated 8 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆143Updated 2 months ago
- Chinese Stable Diffusion, zh SD,中文文生图,中文SD,中文Stable Diffusion☆49Updated last year
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆180Updated 2 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Updated 7 months ago
- Chinese CLIP models with SOTA performance.☆60Updated 2 years ago
- [WWW 2025] Official PyTorch Code for "CTR-Driven Advertising Image Generation with Multimodal Large Language Models"☆61Updated 5 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Updated last year
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models☆72Updated last year
- ☆72Updated 2 years ago