π§Ά Minimal PyTorch Soft Actor Critic (SAC) implementation
β38Feb 19, 2022Updated 4 years ago
Alternatives and similar repositories for SAC_PyTorch
Users that are interested in SAC_PyTorch are comparing it to the libraries listed below
Sorting:
- Sequential Monte Carlo sampler for PyMC2 models.β13Apr 4, 2018Updated 7 years ago
- A simple and easy to use implementation of the soft actor-critic algorithm.β15Sep 2, 2022Updated 3 years ago
- β14Oct 7, 2022Updated 3 years ago
- Variational Reinforcement Learningβ17Jul 25, 2024Updated last year
- Docker containers of baseline agents for the Crafter environmentβ30Dec 14, 2021Updated 4 years ago
- π Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)β18Jul 6, 2023Updated 2 years ago
- π΄ OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)β25Jun 20, 2021Updated 4 years ago
- [AAAI 2021 Workshop] The official repository for the LST-MAP model for few-shot image classification.β13Feb 12, 2021Updated 5 years ago
- Probabilistic inference for models of behaviourβ10Oct 13, 2025Updated 4 months ago
- General framework for Bayesian inversion of continuous hierarchical modelsβ10Sep 20, 2021Updated 4 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.β18Jan 16, 2023Updated 3 years ago
- Bayesian model reduction for probabilistic machine learningβ11Jul 3, 2025Updated 7 months ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actionsβ30Jun 30, 2020Updated 5 years ago
- MuJoCo models for Unitree Robotsβ12Nov 24, 2021Updated 4 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyroβ12Jun 14, 2018Updated 7 years ago
- PyTorch implementation of the original evidental-deep-learning@https://github.com/aamini/evidential-deep-learning/β13Sep 20, 2021Updated 4 years ago
- Implementing Visual Saliency Modelsβ13Jan 10, 2018Updated 8 years ago
- DrQ: Data regularized Qβ420Jan 13, 2023Updated 3 years ago
- Use deep learning to learn Koopman operator and LQR for optimal controlβ17Sep 28, 2020Updated 5 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Modelsβ31Apr 30, 2021Updated 4 years ago
- My Body Is A Cageβ41Apr 13, 2021Updated 4 years ago
- Flax Implementation of DreamerV3 on Crafterβ18Nov 29, 2025Updated 3 months ago
- A minimal implementation of Go-Explore without domain knowledgeβ15Apr 26, 2021Updated 4 years ago
- Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"β15Apr 24, 2023Updated 2 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Leβ¦β18Apr 13, 2021Updated 4 years ago
- Implementation of importance sampling, direct, and hybrid methods for off-policy evaluation.β16Mar 28, 2020Updated 5 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.β17Nov 17, 2019Updated 6 years ago
- Active inference implementation of dynamic multi-armed banditsβ20Jun 25, 2025Updated 8 months ago
- Various code/notebooks to benchmark different ways we could estimate uncertainty in ML predictions.β42Jun 7, 2021Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QTβ¦β16Nov 18, 2020Updated 5 years ago
- β25Jan 2, 2019Updated 7 years ago
- Simplistic Pytorch Implementation of the Dreamer-RLβ20May 7, 2025Updated 9 months ago
- Conservative Q Learning on top of SACβ137Oct 15, 2022Updated 3 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtreeβ26May 2, 2025Updated 10 months ago
- β23Aug 19, 2022Updated 3 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)β26Oct 11, 2022Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimationβ25Jul 18, 2023Updated 2 years ago
- Conformal Histogram Regression: efficient conformity scores for non-parametric regression problemsβ24Mar 26, 2022Updated 3 years ago
- JAX implementations of core Deep RL algorithmsβ83May 2, 2022Updated 3 years ago