mengcye / LAION-SG
☆47Updated 3 weeks ago
Alternatives and similar repositories for LAION-SG:
Users that are interested in LAION-SG are comparing it to the libraries listed below
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆41Updated 5 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆103Updated 7 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 8 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆44Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 8 months ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆29Updated last month
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- The official implementation of PAR: Parallelized Autoregressive Visual Generation. https://epiphqny.github.io/PAR-project/☆103Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 5 months ago
- ☆33Updated 3 weeks ago
- ☆40Updated 5 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆35Updated last week
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆66Updated 5 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 2 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆65Updated last month
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆20Updated 3 weeks ago
- Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆50Updated 9 months ago
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆62Updated 7 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 4 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 9 months ago
- The official repo of continuous speculative decoding☆19Updated last month
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆79Updated 8 months ago
- Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆52Updated 3 months ago
- ☆20Updated 2 weeks ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆88Updated 9 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated 8 months ago
- LaVin-DiT☆16Updated last month
- ☆19Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆39Updated last month