AMD-AGI / Instella-T2ILinks
☆24Updated 5 months ago
Alternatives and similar repositories for Instella-T2I
Users that are interested in Instella-T2I are comparing it to the libraries listed below
Sorting:
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated 2 years ago
- ☆113Updated last year
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆134Updated 9 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Updated last year
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32Updated 2 years ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆89Updated last year
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Updated 9 months ago
- Benchmarking Attention Mechanism in Vision Transformers.☆19Updated 3 years ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆76Updated last year
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Updated last year
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆21Updated 4 months ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆83Updated 3 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆69Updated 4 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago