xplip / pixel
Research code for pixel-based encoders of language (PIXEL)
☆335Updated last year
Alternatives and similar repositories for pixel
Users that are interested in pixel are comparing it to the libraries listed below
Sorting:
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆387Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆451Updated last year
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆313Updated last year
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆227Updated 8 months ago
- Language Models Can See: Plugging Visual Controls in Text Generation☆256Updated 2 years ago
- ☆128Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆202Updated 8 months ago
- Code for the ALiBi method for transformer language models (ICLR 2022)☆526Updated last year
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆483Updated last year
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆167Updated 2 years ago
- Sequence modeling with Mega.☆295Updated 2 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆117Updated 3 years ago
- MERLOT: Multimodal Neural Script Knowledge Models☆224Updated 3 years ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- FairSeq repo with Apollo optimizer☆114Updated last year
- Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models☆142Updated 2 years ago
- MEND: Fast Model Editing at Scale☆245Updated last year
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆44Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)☆54Updated 2 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆140Updated 2 years ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- ☆182Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆783Updated 11 months ago
- Understanding the Difficulty of Training Transformers☆329Updated 2 years ago
- Search Engines with Autoregressive Language models☆285Updated 2 years ago
- Conceptual 12M is a dataset containing (image-URL, caption) pairs collected for vision-and-language pre-training.☆390Updated 2 years ago