gaasher / I-JEPA
Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"
☆269Updated 3 months ago
Alternatives and similar repositories for I-JEPA:
Users that are interested in I-JEPA are comparing it to the libraries listed below
- Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA☆180Updated last year
- Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch☆407Updated 3 months ago
- An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …☆360Updated last year
- ☆184Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆189Updated last year
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆336Updated 5 months ago
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆530Updated last year
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆39Updated last year
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆560Updated last year
- Code release for "Dropout Reduces Underfitting"☆313Updated last year
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆482Updated last year
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆707Updated last year
- Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆405Updated 8 months ago
- ☆522Updated 2 weeks ago
- This is the repository for the Photorealistic Unreal Graphics (PUG) datasets for representation learning.☆234Updated last year
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆572Updated 2 years ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,238Updated 2 years ago
- Official PyTorch implementation of the paper "In-Context Learning Unlocked for Diffusion Models"☆404Updated last year
- Learning from synthetic data - code and models☆314Updated last year
- ☆201Updated last year
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆420Updated 2 years ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆211Updated 8 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆314Updated 10 months ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆283Updated 3 weeks ago
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆972Updated last year
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆186Updated last year
- When do we not need larger vision models?☆388Updated 2 months ago
- 🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".☆455Updated last year
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆298Updated last year