minerllabs / basalt-benchmarkLinks
BASALT Benchmark datasets, evaluation code and agent training example.
☆21Updated 2 years ago
Alternatives and similar repositories for basalt-benchmark
Users that are interested in basalt-benchmark are comparing it to the libraries listed below
Sorting:
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆114Updated last year
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆97Updated 10 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆46Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated last year
- Code for Contrastive Preference Learning (CPL)☆177Updated last year
- ☆15Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆144Updated last year
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆77Updated last year
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos (ICLR'24, Spotlight)☆65Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Updated 2 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆200Updated last year
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆35Updated 9 months ago
- Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"☆22Updated last year
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆56Updated last year
- off-policy RL on long sequences☆154Updated 4 months ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆130Updated 3 years ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆29Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Verlog: A Multi-turn RL framework for LLM agents☆67Updated last month
- An RL-Friendly Vision-Language Model for Minecraft☆38Updated last year
- Using advances in generative modeling to learn reward functions from unlabeled videos.☆136Updated last year
- ☆79Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆133Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆61Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆118Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 5 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆161Updated this week