snumprlab / capeamLinks

Official Implementation of CAPEAM (ICCV'23)

☆14

Alternatives and similar repositories for capeam

Users that are interested in capeam are comparing it to the libraries listed below

Sorting:

hitachi-rd-cv / prompter-alfred
Prompter for Embodied Instruction Following
☆18Updated 2 years ago
Dantong88 / LLARVA
☆60Updated 11 months ago
snumprlab / realfred
Official Implementation of ReALFRED (ECCV'24)
☆43Updated last year
thunlp / EmbodiedEval
Evaluate Multimodal LLMs as Embodied Agents
☆54Updated 9 months ago
EmbodiedGPT / EgoCOT_Dataset
☆54Updated last year
TencentARC / Moto
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆148Updated last month
rainbow979 / robodreamer
☆87Updated last year
aopolin-lv / RoboMP2
[ICML 2024] RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models
☆12Updated 5 months ago
google-deepmind / robovqa
☆33Updated last year
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆76Updated 6 months ago
Gabesarch / HELPER
☆32Updated last year
AlbertTan404 / pytorch-open-x-embodiment
Data pre-processing and training code on Open-X-Embodiment with pytorch
☆11Updated 10 months ago
stevenyangyj / Emma-Alfworld
Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld
☆60Updated last year
EmbodiedBench / EmbodiedBench
[ICML 2025 Oral] Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
☆218Updated last month
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆74Updated 11 months ago
UMass-Embodied-AGI / COMBO
Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"
☆44Updated 8 months ago
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆110Updated 7 months ago
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆151Updated 7 months ago
snumprlab / cl-alfred
Official Implementation of CL-ALFRED (ICLR'24)
☆28Updated last year
sled-group / RACER
[ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning
☆38Updated last year
vlc-robot / hiveformer
☆33Updated last year
UMass-Embodied-AGI / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆134Updated last year
2toinf / DecisionNCE
[ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"
☆82Updated 6 months ago
allenai / embodied-clip
Official codebase for EmbCLIP
☆132Updated 2 years ago
sled-group / navchat
Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …
☆31Updated last year
flow-diffusion / AVDC
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.
☆236Updated last year
BeingBeyond / Being-H0
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos
☆180Updated 2 months ago
eric-ai-lab / VLMbench
NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"
☆96Updated 6 months ago
kodenii / Responsible-Robotic-Manipulation
Responsible Robotic Manipulation
☆14Updated 3 months ago
valtsblukis / hlsm
☆45Updated 3 years ago