IDEA-Research / hana
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hana
- ☆44Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆38Updated last year
- ☆10Updated 10 months ago
- Code for FineRewards☆19Updated last year
- Teach-DETR: Better Training DETR with Teachers☆29Updated 8 months ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆46Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- ☆19Updated last year
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆34Updated 2 weeks ago
- ☆21Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆23Updated 9 months ago
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆19Updated 3 weeks ago
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated last year
- ☆38Updated 11 months ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆65Updated 9 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- Code release for LayoutDiffuse☆50Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆26Updated 2 weeks ago
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆25Updated last month
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 7 months ago
- Official implementation of Aurora☆81Updated last year
- ☆24Updated last year
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆31Updated 2 years ago
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆12Updated last week
- Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story generation with strong…☆20Updated 3 months ago
- Unofficial implementation of DragDiffusion☆36Updated last year
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- a collection of awesome autoregressive visual generation models☆42Updated last week
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year