aim-uofa / VLModel
Repo of HawkLlama.
☆10Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for VLModel
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆50Updated 5 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆48Updated this week
- ☆38Updated 11 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆46Updated 2 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- ☆29Updated last week
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆82Updated 2 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆88Updated last month
- ☆91Updated 5 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆34Updated 3 weeks ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆90Updated 7 months ago
- This is the official implementation for DragVideo☆42Updated last month
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆61Updated last month
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 7 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆108Updated last month
- Official implementation of Aurora☆81Updated last year
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆126Updated 6 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors☆82Updated last week
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆48Updated last week
- a collection of awesome autoregressive visual generation models☆37Updated this week
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆84Updated 7 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated 3 weeks ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated 3 weeks ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆87Updated 3 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- ☆20Updated last week
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆64Updated 8 months ago
- This is the official implementation for ControlVAR.☆52Updated 3 weeks ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆16Updated 5 months ago