LAION-AI / Conditional-Pretraining-of-Large-Language-Models
☆37Updated last year
Related projects: ⓘ
- ImageNet-12k subset of ImageNet-21k (fall11)☆19Updated last year
- ☆21Updated this week
- ☆18Updated 3 weeks ago
- Directed masked autoencoders☆13Updated last year
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- ☆24Updated 3 years ago
- ☆29Updated last year
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆20Updated 6 months ago
- Using FlexAttention to compute attention with different masking patterns☆28Updated last week
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆15Updated 10 months ago
- ☆32Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆15Updated 5 months ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆35Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆42Updated last year
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 2 years ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆29Updated 2 years ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆20Updated last week
- ☆20Updated last year
- ViT trained on COYO-Labeled-300M dataset☆29Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆18Updated 3 months ago
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 2 years ago
- ☆26Updated last year
- ☆15Updated last year
- ☆29Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆40Updated 3 months ago
- ☆24Updated 7 months ago
- Stay tuned!☆11Updated 5 months ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆34Updated 2 years ago