Research code for pixel-based encoders of language (PIXEL)
β345Jul 15, 2025Updated 9 months ago
Alternatives and similar repositories for pixel
Users that are interested in pixel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains an extension of fairseq for pixel / visual representations of text for machine translation.β37Feb 2, 2024Updated 2 years ago
- π VITRina: VIsual Token Representationsβ11Jun 15, 2023Updated 2 years ago
- β85Dec 4, 2022Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-trainingβ790Feb 9, 2023Updated 3 years ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"β36Jun 7, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Omnivore: A Single Model for Many Visual Modalitiesβ573Nov 12, 2022Updated 3 years ago
- Official pytorch implementation of I2I translation with low resolution conditioningβ23Sep 2, 2021Updated 4 years ago
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ80Jan 7, 2026Updated 3 months ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"β414Mar 25, 2024Updated 2 years ago
- Language Models Can See: Plugging Visual Controls in Text Generationβ259Jun 1, 2022Updated 3 years ago
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)β34Feb 5, 2023Updated 3 years ago
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".β317Aug 7, 2023Updated 2 years ago
- Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020β12Jul 2, 2022Updated 3 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semantiβ¦β21Jul 11, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)β942Nov 7, 2023Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.β90Sep 12, 2024Updated last year
- Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)β463May 9, 2022Updated 3 years ago
- β88Jan 10, 2024Updated 2 years ago
- β43Aug 9, 2022Updated 3 years ago
- Patching open-vocabulary models by interpolating weightsβ91Sep 28, 2023Updated 2 years ago
- Code for paper βLanguage Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Abilityββ15Jun 13, 2023Updated 2 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorchβ76Dec 4, 2022Updated 3 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learningβ2,811Mar 21, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β19Updated this week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.β3,430May 19, 2025Updated 11 months ago
- PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)β372Jul 29, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifiβ¦β31Dec 5, 2022Updated 3 years ago
- β12Mar 12, 2023Updated 3 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".β13Sep 17, 2021Updated 4 years ago
- Paper List for In-context Learning π·β19Jan 3, 2023Updated 3 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigmβ675Sep 19, 2022Updated 3 years ago
- Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Lβ¦β2,558Apr 24, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)β211Dec 18, 2022Updated 3 years ago
- Exploring Visual Prompts for Adapting Large-Scale Modelsβ289Jun 6, 2022Updated 3 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"β15Aug 30, 2023Updated 2 years ago
- Code for pre-training CharacterBERT models (as well as BERT models).β34Sep 6, 2021Updated 4 years ago
- Datasets for compositional learningβ11Nov 28, 2018Updated 7 years ago
- ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Modelsβ341Feb 17, 2024Updated 2 years ago
- Code for the Globetrotter projectβ23Mar 17, 2022Updated 4 years ago