gaasher / I-JEPA
Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"
☆265Updated 2 months ago
Alternatives and similar repositories for I-JEPA:
Users that are interested in I-JEPA are comparing it to the libraries listed below
- Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA☆180Updated last year
- Official code for "TOAST: Transfer Learning via Attention Steering"☆189Updated last year
- ☆183Updated last year
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆40Updated last year
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆297Updated last year
- Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch☆407Updated 2 months ago
- [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"☆312Updated 9 months ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,235Updated 2 years ago
- Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch☆271Updated 11 months ago
- Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.☆185Updated last year
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆961Updated last year
- Implementation of Block Recurrent Transformer - Pytorch☆218Updated 7 months ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,029Updated 9 months ago
- ☆201Updated last year
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆210Updated 7 months ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆331Updated 4 months ago
- Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"☆339Updated 2 years ago
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)☆902Updated last year
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆417Updated last year
- Code release for "Learning Video Representations from Large Language Models"☆512Updated last year
- Official code for VisProg (CVPR 2023 Best Paper!)☆712Updated 7 months ago
- Robust fine-tuning of zero-shot models☆683Updated 2 years ago
- VICRegL official code base☆226Updated 2 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆226Updated last year
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time☆452Updated 8 months ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆708Updated last year
- Code release for "Dropout Reduces Underfitting"☆312Updated last year
- DataComp: In search of the next generation of multimodal datasets☆688Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated last year