Multimodal Masked Autoencoders (M3AE): A JAX/Flax Implementation
☆108Feb 26, 2025Updated last year
Alternatives and similar repositories for m3ae_public
Users that are interested in m3ae_public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Instruction Following Agents with Multimodal Transforemrs☆54Nov 3, 2022Updated 3 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆623Dec 13, 2022Updated 3 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆160Nov 1, 2022Updated 3 years ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆31Jan 26, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Nov 28, 2022Updated 3 years ago
- Code Release for "On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies"☆16Apr 13, 2021Updated 4 years ago
- ☆18Mar 18, 2026Updated 3 weeks ago
- [ICRA 2020] Implementation of Adversarial Skill Networks for learning reusable and composable skills from unlabeled videos.☆21Oct 3, 2023Updated 2 years ago
- Hierarchical Universal Language Conditioned Policies☆78Mar 19, 2024Updated 2 years ago
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- ☆54Jan 20, 2023Updated 3 years ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆40Nov 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆19Dec 24, 2021Updated 4 years ago
- Official Repository for VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning☆25Apr 12, 2024Updated 2 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Understanding Self-Supervised Learning in a non-IID Setting☆21Oct 21, 2022Updated 3 years ago
- ☆101Dec 2, 2025Updated 4 months ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…☆12Dec 30, 2021Updated 4 years ago
- RareAct: A video dataset of unusual interactions☆34Aug 4, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆678Aug 14, 2024Updated last year
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆77May 27, 2023Updated 2 years ago
- [MICCAI-2022] This is the official implementation of Multi-Modal Masked Autoencoders for Medical Vision-and-Language Pre-Training.☆130Sep 16, 2022Updated 3 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).☆21Mar 4, 2023Updated 3 years ago
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆460Dec 2, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code and data for the project "Visually grounded continual learning of compositional semantics"☆22Dec 27, 2022Updated 3 years ago
- ICCV 2021☆34May 11, 2022Updated 3 years ago
- ☆16Apr 10, 2022Updated 4 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- PyTorch implementation of MAE https//arxiv.org/abs/2111.06377☆8,262Jul 23, 2024Updated last year
- Directed masked autoencoders☆14Mar 25, 2026Updated 2 weeks ago