IDEA-Research / hana
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Updated 2 years ago
Alternatives and similar repositories for hana:
Users that are interested in hana are comparing it to the libraries listed below
- ☆43Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆40Updated last year
- ☆38Updated last year
- ☆19Updated last year
- Code release for LayoutDiffuse☆53Updated last year
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆24Updated 4 months ago
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"☆15Updated last month
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆47Updated 9 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆29Updated last month
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆79Updated 10 months ago
- Official implementation of Aurora☆82Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- Code for FineRewards☆19Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆45Updated last year
- The official code of "Concept-centric Personalization with Large-scale Diffusion Priors".☆17Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆27Updated 2 months ago
- ☆11Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆29Updated 2 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆33Updated last week
- ☆14Updated 3 years ago
- ICCV2023-Diffusion-Papers☆109Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated 2 years ago
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆37Updated last year
- One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations. NeurIPS2022.☆34Updated 2 years ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆68Updated last year
- ☆34Updated 4 months ago
- Teach-DETR: Better Training DETR with Teachers☆30Updated 11 months ago
- Video Diffusion State Space Models☆19Updated 10 months ago
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Updated 2 years ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated last year