FYQ0919 / PTSA-MCTS
A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].
☆13Updated last year
Alternatives and similar repositories for PTSA-MCTS:
Users that are interested in PTSA-MCTS are comparing it to the libraries listed below
- Official code repository for Prompt-DT.☆104Updated 2 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆108Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆95Updated last year
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆87Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 7 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆93Updated 3 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆20Updated 2 months ago
- Synthetic Experience Replay☆86Updated 8 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 4 months ago
- ☆72Updated 8 months ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆28Updated last year
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆104Updated last year
- Transformer-based World Models☆76Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆84Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Clean single-file implementation of offline RL algorithms in JAX☆132Updated last month
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆48Updated last year
- DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards☆21Updated 9 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆63Updated 8 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 6 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆84Updated 6 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆62Updated last year
- ☆70Updated 4 months ago