Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆107Feb 26, 2025Updated last year
Alternatives and similar repositories for m3ae_public
Users that are interested in m3ae_public are comparing it to the libraries listed below
Sorting:
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆30Jan 26, 2023Updated 3 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- PyTorch implementation of the Hiveformer research paper☆48Jun 27, 2023Updated 2 years ago
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆39Jan 10, 2023Updated 3 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆16Apr 13, 2021Updated 4 years ago
- ☆18Mar 10, 2026Updated last week
- [ICRA 2020] Implementation of Adversarial Skill Networks for learning reusable and composable skills from unlabeled videos.☆20Oct 3, 2023Updated 2 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- Hierarchical Universal Language Conditioned Policies☆77Mar 19, 2024Updated 2 years ago
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆13Nov 4, 2023Updated 2 years ago
- ☆27Mar 6, 2025Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆99Dec 2, 2025Updated 3 months ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Understanding Self-Supervised Learning in a non-IID Setting☆21Oct 21, 2022Updated 3 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Interactive Fleet Learning Benchmark☆37May 18, 2023Updated 2 years ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- ☆10May 24, 2021Updated 4 years ago
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…☆12Dec 30, 2021Updated 4 years ago
- ☆60Apr 16, 2023Updated 2 years ago
- RareAct: A video dataset of unusual interactions☆33Aug 4, 2020Updated 5 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated 11 months ago
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- The official implementation of Self-aware Object Detection [CVPR 2023]☆13Jun 30, 2023Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆677Aug 14, 2024Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆76May 27, 2023Updated 2 years ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.☆130Sep 16, 2022Updated 3 years ago
- ☆231Dec 18, 2023Updated 2 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆459Dec 2, 2024Updated last year
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆21Mar 4, 2023Updated 3 years ago
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- ☆16Apr 10, 2022Updated 3 years ago