saicoco / webdataset
pytorch大规模数据读取dataset
☆10Updated 2 years ago
Related projects: ⓘ
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆17Updated last year
- ☆17Updated last month
- ☆44Updated last year
- ☆72Updated 8 months ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆24Updated 5 months ago
- ☆24Updated last year
- [ACM MM 2024] Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization☆10Updated last month
- Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)☆94Updated 2 years ago
- Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models☆50Updated 8 months ago
- ☆41Updated this week
- ☆19Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆36Updated 11 months ago
- Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning☆58Updated 3 months ago
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedback☆29Updated last month
- Landmark Deep Equilibrium Model (LDEQ), applied to videos with a Recurrence without Recurrence (RwR) paradigm☆36Updated last year
- ☆15Updated this week
- ☆52Updated last year
- A paper list of some recent works about Token Compress for Vit and VLM☆32Updated last week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆37Updated last year
- ☆52Updated last month
- ☆104Updated 5 months ago
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆18Updated last week
- ☆20Updated 9 months ago
- ☆13Updated last month
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆62Updated 7 months ago
- A curated list of papers, code, and resources pertaining to generative image composition or object insertion.☆74Updated 2 months ago
- An official implementation of "Hulk: A Universal Knowledge Translator for Human-Centric Tasks"☆83Updated 3 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆46Updated 5 months ago
- ☆56Updated last year