minerllabs / basalt-benchmarkLinks
BASALT Benchmark datasets, evaluation code and agent training example.
☆21Updated last year
Alternatives and similar repositories for basalt-benchmark
Users that are interested in basalt-benchmark are comparing it to the libraries listed below
Sorting:
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆113Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆77Updated last year
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆46Updated 2 years ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆92Updated 9 months ago
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆65Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Code for Contrastive Preference Learning (CPL)☆176Updated 11 months ago
- ☆15Updated last year
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆195Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 4 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆64Updated 2 weeks ago
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆135Updated last year
- An RL-Friendly Vision-Language Model for Minecraft☆38Updated last year
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆35Updated 8 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆159Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- off-policy RL on long sequences☆147Updated 3 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆143Updated last year
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆70Updated last year
- Object Centric Atari games☆95Updated 3 weeks ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆132Updated 2 years ago
- ☆105Updated last year
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆43Updated last year
- Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"☆22Updated last year
- ☆65Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated 11 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆207Updated 3 months ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆130Updated 3 years ago