Official Implementation of CAPEAM (ICCV'23)
☆16Nov 30, 2024Updated last year
Alternatives and similar repositories for capeam
Users that are interested in capeam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompter for Embodied Instruction Following☆18Nov 30, 2023Updated 2 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆40Jun 21, 2024Updated last year
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- Official Implementation of ReALFRED (ECCV'24)☆45Oct 11, 2024Updated last year
- Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods☆127Apr 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Implementation of CL-ALFRED (ICLR'24)☆31Oct 24, 2024Updated last year
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated last week
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆13May 5, 2025Updated 11 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19May 2, 2025Updated 11 months ago
- The code for 'COVID-19 Lung Infection Segmentation with A Novel Two-Stage Cross-Domain Transfer Learning Framework'☆11Aug 16, 2021Updated 4 years ago
- A mini-framework for running AI2-Thor with Docker.☆37Apr 26, 2024Updated last year
- Yet another RL Baseline repo.☆13May 28, 2024Updated last year
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆219Mar 26, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- ☆10Sep 12, 2024Updated last year
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆282Apr 8, 2026Updated last week
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- ☆52Jan 9, 2025Updated last year
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes☆12Sep 2, 2024Updated last year
- [TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator☆11Apr 23, 2023Updated 2 years ago
- This code is submitted to ICCV Workshop 2017: Fake vs. true facial emotion recognition competition☆11Oct 17, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The implementation of FINER-MLLM, which is accepted by MM2024.☆18Oct 8, 2024Updated last year
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 3 months ago
- This is the official code repository for "MEW-UNet: Multi-axis representation learning in frequency domain for medical image segmentation…☆31Nov 8, 2022Updated 3 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 4 months ago
- ☆34May 27, 2023Updated 2 years ago
- MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos☆45Feb 10, 2026Updated 2 months ago
- ☆18Jan 19, 2026Updated 2 months ago
- Disentangled 3D face animation generator☆21Feb 21, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Home of the Robot Common Sense Embedding☆10Sep 18, 2021Updated 4 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- [ICML 2024] RAUCA: A robust and accurate adversarial camouflage generation method☆25Nov 29, 2025Updated 4 months ago
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆20Apr 18, 2025Updated 11 months ago
- ☆20Nov 4, 2023Updated 2 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Jul 20, 2025Updated 8 months ago