gaasher / I-JEPALinks

Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"

☆272

Alternatives and similar repositories for I-JEPA

Users that are interested in I-JEPA are comparing it to the libraries listed below

Sorting:

fabawi / ImageBind-LoRA
Fine-tuning "ImageBind One Embedding Space to Bind Them All" with LoRA
☆184Updated last year
facebookresearch / hiera
Hiera: A fast, powerful, and simple hierarchical vision transformer.
☆994Updated last year
lucidrains / recurrent-memory-transformer-pytorch
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
☆409Updated 5 months ago
bfshi / TOAST
Official code for "TOAST: Transfer Learning via Attention Steering"
☆189Updated last year
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆562Updated 4 months ago
facebookresearch / dropout
Code release for "Dropout Reduces Underfitting"
☆313Updated 2 years ago
kyegomez / zeta
Build high-performance AI models with modular building blocks
☆528Updated 2 weeks ago
allenai / unified-io-2
☆614Updated last year
lucidrains / block-recurrent-transformer-pytorch
Implementation of Block Recurrent Transformer - Pytorch
☆219Updated 10 months ago
gstoica27 / ZipIt
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆300Updated last year
lucidrains / simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
☆214Updated 10 months ago
lucidrains / soft-moe-pytorch
Implementation of Soft MoE, proposed by Brain's Vision team, in Pytorch
☆298Updated 2 months ago
hamidkazemi22 / vit-visualization
☆184Updated last year
minyoungg / platonic-rep
☆567Updated 2 months ago
mlfoundations / model-soups
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
☆470Updated 11 months ago
lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆277Updated 10 months ago
kyegomez / Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
☆381Updated last year
kyegomez / Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
☆171Updated 2 months ago
LumenPallidium / jepa
Experiments in Joint Embedding Predictive Architectures (JEPAs).
☆40Updated last year
wpeebles / G.pt
Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"
☆341Updated 2 years ago
lucidrains / flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
☆1,248Updated 2 years ago
facebookresearch / LaViLa
Code release for "Learning Video Representations from Large Language Models"
☆524Updated last year
internet-explorer-ssl / internet-explorer
Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…
☆163Updated 2 years ago
lucidrains / mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆89Updated last year
bfshi / scaling_on_scales
When do we not need larger vision models?
☆395Updated 4 months ago
mshukor / UnIVAL
[TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
☆228Updated last year
kyegomez / CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal …
☆361Updated last year
lucidrains / parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆533Updated last year
penghao-wu / vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
☆629Updated last year
facebookresearch / flip
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
☆426Updated 2 years ago