facebookresearch / ijepaLinks
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
☆2,994Updated last year
Alternatives and similar repositories for ijepa
Users that are interested in ijepa are comparing it to the libraries listed below
Sorting:
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,086Updated 3 months ago
- An open-source framework for training large multimodal models.☆3,952Updated 9 months ago
- PyTorch code and models for the DINOv2 self-supervised learning method.☆10,845Updated last week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,883Updated last year
- [NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"☆4,624Updated 10 months ago
- Painter & SegGPT Series: Vision Foundation Models from BAAI☆2,572Updated 6 months ago
- Multimodal-GPT☆1,502Updated 2 years ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆2,950Updated last month
- Hiera: A fast, powerful, and simple hierarchical vision transformer.☆994Updated last year
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"☆7,449Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,073Updated 9 months ago
- Meta-Transformer for Unified Multimodal Learning☆1,608Updated last year
- Foundation Architecture for (M)LLMs☆3,084Updated last year
- This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural R…☆1,934Updated last year
- Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).☆2,246Updated 2 years ago
- Official repo for consistency models.☆6,360Updated last year
- Segment Anything in High Quality [NeurIPS 2023]☆3,990Updated 6 months ago
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,140Updated 6 months ago
- ImageBind One Embedding Space to Bind Them All☆8,693Updated 10 months ago
- An Open-source Toolkit for LLM Development☆2,782Updated 5 months ago
- EVA Series: Visual Representation Fantasies from BAAI☆2,511Updated 10 months ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,248Updated 2 years ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,642Updated 11 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆963Updated last year
- 4M: Massively Multimodal Masked Modeling☆1,735Updated 2 weeks ago
- Consistency Distilled Diff VAE☆2,189Updated last year
- Schedule-Free Optimization in PyTorch☆2,179Updated last month
- ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Expert…☆1,460Updated 3 months ago
- ☆1,703Updated 8 months ago
- Scenic: A Jax Library for Computer Vision Research and Beyond☆3,566Updated last week