☆110Feb 11, 2026Updated 4 months ago
Alternatives and similar repositories for alexa-arena
Users that are interested in alexa-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones☆13Jul 27, 2022Updated 3 years ago
- Repository for DialFRED.☆45Sep 14, 2023Updated 2 years ago
- TEACh is a dataset of human-human interactive dialogues to complete tasks in a simulated household environment.☆145May 6, 2024Updated 2 years ago
- ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks☆517Feb 5, 2026Updated 4 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26May 22, 2024Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆93Jul 11, 2023Updated 2 years ago
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- ☆48Jun 24, 2022Updated 3 years ago
- Official repository of ICLR 2022 paper FILM: Following Instructions in Language with Modular Methods☆128Apr 9, 2023Updated 3 years ago
- ☆13Nov 5, 2024Updated last year
- ☆33Sep 22, 2024Updated last year
- A World Model-Based Framework for Vision-Language Robot Manipulation☆32Oct 22, 2025Updated 7 months ago
- [ICCV 2023] ARNOLD: Language-Grounded Robot Manipulation with Continuous Object States in Realistic 3D Scenes☆186Mar 16, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- ☆26Oct 28, 2022Updated 3 years ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆44Apr 17, 2023Updated 3 years ago
- A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future…☆597May 2, 2024Updated 2 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- ☆21Oct 16, 2025Updated 8 months ago
- Visual Navigation with Natural Multimodal Assistance (EMNLP 2019)☆29Jun 30, 2020Updated 5 years ago
- Code for NeurIPS 2021 paper "Curriculum Learning for Vision-and-Language Navigation"☆15Dec 13, 2022Updated 3 years ago
- Official Implementation of ReALFRED (ECCV'24)☆46Oct 11, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"☆18Aug 7, 2025Updated 10 months ago
- ☆60Apr 21, 2022Updated 4 years ago
- Image Reconstruction and Restoration of Cats and Dogs Dataset using PyTorch's Torch and Torchvision Libraries☆13Oct 2, 2020Updated 5 years ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆45Apr 19, 2024Updated 2 years ago
- Multi-Target Embodied Question Answering☆26Jul 17, 2020Updated 5 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- Training code of waypoint predictor in Discrete-to-Continuous VLN.☆32Mar 25, 2024Updated 2 years ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆20Oct 21, 2023Updated 2 years ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆226Mar 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆21May 18, 2023Updated 3 years ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- Robotics Learning Note☆11Jun 22, 2018Updated 7 years ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- ☆24Oct 8, 2023Updated 2 years ago
- Preference Learning for LLaVA☆59Nov 9, 2024Updated last year
- Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs☆112Sep 30, 2025Updated 8 months ago