yunyikristy / DualMind
☆17Updated last year
Alternatives and similar repositories for DualMind
Users that are interested in DualMind are comparing it to the libraries listed below
Sorting:
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆80Updated 7 months ago
- ☆45Updated last year
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆17Updated 9 months ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆104Updated 11 months ago
- Official code for "QueST: Self-Supervised Skill Abstractions for Continuous Control" [NeurIPS 2024]☆77Updated 5 months ago
- ☆38Updated 8 months ago
- ☆40Updated last year
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆57Updated 7 months ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆91Updated last week
- ☆41Updated last year
- Codebase for HiP☆89Updated last year
- Instruction Following Agents with Multimodal Transforemrs☆52Updated 2 years ago
- ☆55Updated 10 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆74Updated 11 months ago
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆78Updated last month
- ☆69Updated 7 months ago
- [IJCAI'24] An index of algorithms, approaches, and systems on cross-domain policy transfer for embodied agents☆50Updated 3 months ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆129Updated last year
- Code for the paper Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance, accepted to CoRL 2023 as an…☆31Updated 9 months ago
- ☆44Updated last year
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆26Updated 2 months ago
- ☆17Updated 3 months ago
- Codebase for PRISE: Learning Temporal Action Abstractions as a Sequence Compression Problem☆22Updated 10 months ago
- Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"☆156Updated last year
- ☆41Updated 7 months ago
- ☆46Updated 5 months ago
- Official code for the long-horizon language-conditioned robotic manipulation benchmark LoHoRavens.☆15Updated 7 months ago
- PyTorch implementation of the Hiveformer research paper☆48Updated last year
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆76Updated last month
- Masked World Models for Visual Control☆122Updated last year