XueFuzhao / HowToRunScenic
☆13Updated 2 years ago
Alternatives and similar repositories for HowToRunScenic:
Users that are interested in HowToRunScenic are comparing it to the libraries listed below
- Official repository for the General Robust Image Task (GRIT) Benchmark☆51Updated last year
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆64Updated 2 years ago
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆99Updated 2 years ago
- CCVS: Context-aware Controllable Video Synthesis☆22Updated 3 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆13Updated last month
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 8 months ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆64Updated 8 months ago
- ☆69Updated 7 months ago
- ☆64Updated last year
- A Video Tokenizer Evaluation Dataset☆101Updated last month
- ☆23Updated last year
- ☆38Updated 2 years ago
- https://arxiv.org/abs/2209.15162☆49Updated 2 years ago
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆68Updated 3 years ago
- Release of ImageNet-Captions☆45Updated 2 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated last year
- ☆58Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆25Updated last week
- ☆23Updated 2 weeks ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆18Updated 2 months ago
- ☆50Updated 2 years ago
- ☆41Updated last year
- A list of papers and other resources on language-guided image editing.☆38Updated 4 years ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆16Updated 4 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆43Updated 8 months ago