IDEA-Research / hana
Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch
☆17Updated 2 years ago
Alternatives and similar repositories for hana:
Users that are interested in hana are comparing it to the libraries listed below
- ☆43Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆40Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆29Updated 3 months ago
- The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.☆20Updated 2 years ago
- [ICLR 24] MaGIC: Multi-modality Guided Image Completion☆47Updated 10 months ago
- ☆19Updated last year
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆25Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago
- ☆11Updated last year
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆68Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Updated 2 years ago
- ☆25Updated last year
- ☆38Updated last year
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆31Updated 2 years ago
- ☆19Updated last year
- Teach-DETR: Better Training DETR with Teachers☆30Updated 11 months ago
- ☆21Updated last year
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆27Updated 3 months ago
- [ICCV 2021] Official PyTorch Code for "Online Knowledge Distillation for Efficient Pose Estimation"☆43Updated last year
- ☆14Updated 3 years ago
- The official code of "Concept-centric Personalization with Large-scale Diffusion Priors".☆18Updated last year
- DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models☆46Updated last year
- An official pytorch implementation of "MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts"☆29Updated 3 months ago
- Code for FineRewards☆19Updated last year
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20Updated 9 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆31Updated 2 months ago