young-geng/m3ae_public

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/young-geng/m3ae_public)

young-geng / m3ae_public

Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation

☆110

Alternatives and similar repositories for m3ae_public

Users that are interested in m3ae_public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

haoliuhl / instructrl
View on GitHub
Instruction Following Agents with Multimodal Transforemrs
☆54Nov 3, 2022Updated 3 years ago
EPFL-VILAB / MultiMAE
View on GitHub
MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022
☆632Dec 13, 2022Updated 3 years ago
ErickRosete / tacorl
View on GitHub
TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning
☆32Jan 26, 2023Updated 3 years ago
XueFuzhao / HowToRunScenic
View on GitHub
☆14Nov 28, 2022Updated 3 years ago
vlc-robot / hiveformer-corl
View on GitHub
PyTorch implementation of the Hiveformer research paper
☆48Jun 27, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bwconrad / can
View on GitHub
PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
☆39Jan 10, 2023Updated 3 years ago
tatsu-lab / mlm_inductive_bias
View on GitHub
Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"
☆16Apr 13, 2021Updated 5 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
lukashermann / hulc
View on GitHub
Hierarchical Universal Language Conditioned Policies
☆78Mar 19, 2024Updated 2 years ago
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
minnesotanlp / infoVerse
View on GitHub
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…
☆16Jun 28, 2023Updated 3 years ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
BerkeleyAutomation / ifl_benchmark
View on GitHub
Interactive Fleet Learning Benchmark
☆39May 18, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
liruiw / Dec-SSL
View on GitHub
Understanding Self-Supervised Learning in a non-IID Setting
☆21Oct 21, 2022Updated 3 years ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
gary23ai / awesome_concept_learning_list
View on GitHub
A curated list of papers & resources linked to concept learning
☆13Aug 9, 2023Updated 2 years ago
antoine77340 / RareAct
View on GitHub
RareAct: A video dataset of unusual interactions
☆35Aug 4, 2020Updated 5 years ago
PeixianChen / MEDet
View on GitHub
☆23Dec 23, 2022Updated 3 years ago
fiveai / saod
View on GitHub
The official implementation of Self-aware Object Detection [CVPR 2023]
☆13Jun 30, 2023Updated 3 years ago
lil-lab / vgnsl_analysis
View on GitHub
"What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…
☆12Dec 30, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
UCSC-VLAA / AdvXL
View on GitHub
[CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
☆20Apr 21, 2024Updated 2 years ago
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆685Aug 14, 2024Updated last year
zhjohnchan / M3AE
View on GitHub
[MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.
☆134Sep 16, 2022Updated 3 years ago
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
geekyutao / TaskRes
View on GitHub
Task Residual for Tuning Vision-Language Models (CVPR 2023)
☆76May 27, 2023Updated 3 years ago
allenai / unified-io-inference
View on GitHub
☆231Dec 18, 2023Updated 2 years ago
ronghanghu / moco_v3_tpu
View on GitHub
☆16Apr 10, 2022Updated 4 years ago
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
RunpeiDong / DreamLLM
View on GitHub
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
☆462Dec 2, 2024Updated last year
ahmedssabir / Textual-Visual-Semantic-Dataset-for-Text-Spotting
View on GitHub
Textual Visual Semantic Dataset for Text Spotting. CVPRW 2020
☆12Jul 2, 2022Updated 4 years ago
INK-USC / VisCOLL
View on GitHub
Code and data for the project "Visually grounded continual learning of compositional semantics"
☆22Dec 27, 2022Updated 3 years ago
zejiangh / MILAN
View on GitHub
PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…
☆84Aug 16, 2022Updated 3 years ago
jason9693 / FROZEN
View on GitHub
☆14May 3, 2022Updated 4 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,366Jul 23, 2024Updated last year
jramapuram / LifelongVAE
View on GitHub
Lifelong Variational Autoencoder
☆15Dec 6, 2017Updated 8 years ago