linhaowei1 / CLoG
✌ CLoG: Benchmarking Continual Learning of Image Generation Models
☆14Updated 3 months ago
Related projects: ⓘ
- 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant☆47Updated last week
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆34Updated 2 months ago
- Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆44Updated 3 weeks ago
- Official Repository of Multi-Object Hallucination in Vision-Language Models☆19Updated last month
- source code for NeurIPS'23 paper "Dream the Impossible: Outlier Imagination with Diffusion Models"☆59Updated 3 months ago
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?"☆32Updated 2 months ago
- LLMBind: A Unified Modality-Task Integration Framework☆14Updated 3 months ago
- Official code for ICLR 2024 paper Do Generated Data Always Help Contrastive Learning?☆25Updated 5 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆76Updated 8 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆26Updated 4 months ago
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆34Updated 3 months ago
- ☆72Updated 5 months ago
- [CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.☆20Updated 4 months ago
- 👀 Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)☆82Updated 9 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆19Updated 3 weeks ago
- [ICLR 2024] Official pytorch implementation of "Denoising Task Routing for Diffusion Models"☆16Updated 7 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆26Updated 3 months ago
- VisualGPTScore for visio-linguistic reasoning☆26Updated 11 months ago
- Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"☆73Updated 6 months ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆59Updated 6 months ago
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆26Updated 2 weeks ago
- ☕️ CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆24Updated 3 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation"☆37Updated this week
- Official code for paper: Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language☆20Updated 2 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆70Updated 2 weeks ago
- LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft☆37Updated 2 months ago
- ☆25Updated 2 months ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆23Updated 2 weeks ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Updated 9 months ago
- Awesome List of Consistency Models☆27Updated 2 weeks ago