LAION-AI / Conditional-Pretraining-of-Large-Language-ModelsLinks
☆37Updated 2 years ago
Alternatives and similar repositories for Conditional-Pretraining-of-Large-Language-Models
Users that are interested in Conditional-Pretraining-of-Large-Language-Models are comparing it to the libraries listed below
Sorting:
- Directed masked autoencoders☆14Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated 2 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19Updated 2 years ago
- ☆65Updated last year
- Code for T-MARS data filtering☆35Updated 2 years ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆86Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- ☆26Updated last year
- Un-*** 50 billions multimodality dataset☆22Updated 3 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated 2 years ago
- ☆29Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆102Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆42Updated 2 years ago
- Official repository for the General Robust Image Task (GRIT) Benchmark☆54Updated 2 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- ☆27Updated 2 years ago
- ☆26Updated 3 years ago
- ☆32Updated 3 years ago
- Paper List for In-context Learning 🌷☆20Updated 2 years ago
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Updated last year
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆48Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 4 years ago
- ☆50Updated last year