Matt00n / PolicyGradientsJax
On-Policy Policy Gradient Algorithms in JAX
☆22Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for PolicyGradientsJax
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆55Updated 10 months ago
- Synthetic Experience Replay☆74Updated 5 months ago
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆26Updated 5 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆62Updated 4 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- ☆55Updated last month
- Foundation Policies with Hilbert Representations (ICML 2024)☆72Updated 7 months ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆73Updated 11 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆22Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆80Updated last year
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆14Updated last week
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- ☆29Updated 8 months ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆36Updated 8 months ago
- Repo for Implicit Diffusion Q-Learning☆93Updated 11 months ago
- Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL, published in ICLR 2023☆29Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆69Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated last month
- An unofficial implementation for online decision transformer☆37Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Benchmarked implementations of Offline RL Algorithms.☆65Updated 6 months ago