Official Implementation of CAPEAM (ICCV'23)
☆16Nov 30, 2024Updated last year
Alternatives and similar repositories for capeam
Users that are interested in capeam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompter for Embodied Instruction Following☆18Nov 30, 2023Updated 2 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆40Jun 21, 2024Updated last year
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- Official Implementation of ReALFRED (ECCV'24)☆45Oct 11, 2024Updated last year
- Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods☆127Apr 9, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Implementation of CL-ALFRED (ICLR'24)☆31Oct 24, 2024Updated last year
- [ICML 2024] RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models☆11Jun 30, 2025Updated 8 months ago
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆12May 5, 2025Updated 10 months ago
- The code of the paper "DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects"☆19May 2, 2025Updated 10 months ago
- The code for 'COVID-19 Lung Infection Segmentation with A Novel Two-Stage Cross-Domain Transfer Learning Framework'☆11Aug 16, 2021Updated 4 years ago
- A mini-framework for running AI2-Thor with Docker.☆37Apr 26, 2024Updated last year
- Yet another RL Baseline repo.☆13May 28, 2024Updated last year
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆218Mar 26, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- ☆10Sep 12, 2024Updated last year
- [ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.☆273Feb 20, 2026Updated last month
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated last year
- ☆50Jan 9, 2025Updated last year
- Code for using the Grasp Affordance Reasoning dataset☆10Sep 17, 2019Updated 6 years ago
- CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes☆12Sep 2, 2024Updated last year
- [TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator☆11Apr 23, 2023Updated 2 years ago
- This code is submitted to ICCV Workshop 2017: Fake vs. true facial emotion recognition competition☆11Oct 17, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The implementation of FINER-MLLM, which is accepted by MM2024.☆18Oct 8, 2024Updated last year
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 3 months ago
- This is the official code repository for "MEW-UNet: Multi-axis representation learning in frequency domain for medical image segmentation…☆31Nov 8, 2022Updated 3 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 4 months ago
- ☆34May 27, 2023Updated 2 years ago
- ☆17Jan 19, 2026Updated 2 months ago
- MimicDroid: In-Context Learning for Humanoid Robot Manipulation from Human Play Videos☆45Feb 10, 2026Updated last month
- Disentangled 3D face animation generator☆21Feb 21, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Home of the Robot Common Sense Embedding☆10Sep 18, 2021Updated 4 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆20Apr 18, 2025Updated 11 months ago
- [ICML 2024] RAUCA: A robust and accurate adversarial camouflage generation method☆25Nov 29, 2025Updated 3 months ago
- ☆20Nov 4, 2023Updated 2 years ago
- [NeurIPS 2025] Official repository for “FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models”☆30Dec 9, 2025Updated 3 months ago