XuweiyiChen / Pix2GifLinks
☆11Updated last year
Alternatives and similar repositories for Pix2Gif
Users that are interested in Pix2Gif are comparing it to the libraries listed below
Sorting:
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆120Updated last year
- A demo of fine tune Stable Diffusion on Pokemon-Blip-Captions in English, Japanese and Chinese Corpus☆37Updated 2 years ago
- 扩散模型算法基础文档、训练、实验、部署等仓库☆39Updated 4 months ago
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆24Updated 7 months ago
- ☆82Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆44Updated 2 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆78Updated last year
- ☆19Updated 3 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆140Updated 8 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆72Updated 11 months ago
- Gradio demo used in our Osprey:Pixel Understanding with Visual Instruction Tuning.☆15Updated last year
- ☆9Updated 10 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆144Updated last year
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆125Updated 8 months ago
- Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4☆27Updated 2 years ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆57Updated 10 months ago
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆75Updated 8 months ago
- ☆80Updated last year
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆164Updated 11 months ago
- ☆105Updated last year
- DiffStyle: Leverage Diffusion Prior to One-for-All Style Transfer. Course project of CS3310 Computer Graphics, built on Prompt-to-Prompt …☆89Updated 2 years ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆84Updated 5 months ago
- ☆84Updated 4 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆60Updated 9 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- ☆16Updated 11 months ago
- [AAAI 2025] LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation☆43Updated 6 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆135Updated 5 months ago
- ☆112Updated 2 years ago