xplip / pixelLinks
Research code for pixel-based encoders of language (PIXEL)
☆339Updated 2 months ago
Alternatives and similar repositories for pixel
Users that are interested in pixel are comparing it to the libraries listed below
Sorting:
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆412Updated last year
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆206Updated last year
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆481Updated last year
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)☆372Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆457Updated 2 years ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆259Updated 3 years ago
- ☆130Updated 3 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆168Updated 2 years ago
- Big-Interleaved-Dataset☆57Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆314Updated 2 years ago
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆142Updated 3 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities☆78Updated 3 years ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆158Updated 2 years ago
- Open source code for AAAI 2023 Paper "BridgeTower: Building Bridges Between Encoders in Vision-Language Representation Learning"☆166Updated 2 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Updated 2 years ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆543Updated last year
- This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…☆134Updated 2 years ago
- M4 experiment logbook☆58Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆402Updated 2 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆96Updated 6 months ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆230Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆319Updated last year
- CLIP (Contrastive Language–Image Pre-training) for Italian☆185Updated 2 years ago
- Sequence modeling with Mega.☆300Updated 2 years ago
- MEND: Fast Model Editing at Scale☆250Updated 2 years ago
- ☆16Updated 2 years ago
- DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)☆142Updated 3 months ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆144Updated 3 years ago