AMD-AGI / Instella-T2ILinks
☆24Updated 4 months ago
Alternatives and similar repositories for Instella-T2I
Users that are interested in Instella-T2I are comparing it to the libraries listed below
Sorting:
- Introduction and scripts for the paper "PartImageNet: A Large, High-Quality Dataset of Parts" (Ju He, Shuo Yang, Shaokang Yang, Adam Kort…☆134Updated 8 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32Updated 2 years ago
- [NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".☆38Updated last year
- ☆113Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆88Updated last year
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆98Updated 8 months ago
- Test-Time Training on Video Streams☆64Updated 2 years ago
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆21Updated 3 months ago
- [CVPR 2022] Official repository of AdaFocusV2.☆90Updated 11 months ago
- Compress conventional Vision-Language Pre-training data☆52Updated 2 years ago
- PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.☆31Updated last year
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆34Updated last year
- ☆12Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆57Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated 2 years ago
- [NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning☆70Updated 9 months ago
- ☆27Updated 3 years ago
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- ☆59Updated 3 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆136Updated 3 months ago
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆72Updated last year
- ☆21Updated 10 months ago
- The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》☆31Updated last year
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…☆43Updated last year
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆83Updated 3 years ago
- ☆27Updated 8 months ago
- PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)☆99Updated 3 years ago
- CaptionQA: Is Your Caption as Useful as the Image Itself?☆19Updated this week