๐งถ Minimal PyTorch Soft Actor Critic (SAC) implementation
โ39Feb 19, 2022Updated 4 years ago
Alternatives and similar repositories for SAC_PyTorch
Users that are interested in SAC_PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple and easy to use implementation of the soft actor-critic algorithm.โ15Sep 2, 2022Updated 3 years ago
- Docker containers of baseline agents for the Crafter environmentโ30Dec 14, 2021Updated 4 years ago
- โ14Oct 7, 2022Updated 3 years ago
- ๐ด OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)โ25Jun 20, 2021Updated 4 years ago
- Sequential Monte Carlo sampler for PyMC2 models.โ13Apr 4, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.โ18Jan 16, 2023Updated 3 years ago
- ๐ Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)โ18Jul 6, 2023Updated 2 years ago
- Probabilistic inference for models of behaviourโ13Mar 5, 2026Updated last month
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)โ30Sep 16, 2022Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Modelsโ30Apr 30, 2021Updated 5 years ago
- [AAAI 2021 Workshop] The official repository for the LST-MAP model for few-shot image classification.โ13Feb 12, 2021Updated 5 years ago
- โ23Aug 19, 2022Updated 3 years ago
- Use deep learning to learn Koopman operator and LQR for optimal controlโ18Sep 28, 2020Updated 5 years ago
- MuJoCo models for Unitree Robotsโ12Nov 24, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DrQ: Data regularized Qโ420Jan 13, 2023Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RLโ20May 7, 2025Updated 11 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).โ106Jun 9, 2020Updated 5 years ago
- "Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.โ13Dec 8, 2021Updated 4 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observationsโ113Apr 16, 2026Updated 2 weeks ago
- Code to reproduce Neural Game Engine experiments and pre-trained modelsโ41Jun 22, 2022Updated 3 years ago
- A custom version of the ros/tf package, with small API changes. Used by some the our code, until we migrate to TF2.โ14Apr 17, 2014Updated 12 years ago
- Various code/notebooks to benchmark different ways we could estimate uncertainty in ML predictions.โ43Jun 7, 2021Updated 4 years ago
- General framework for Bayesian inversion of continuous hierarchical modelsโ10Sep 20, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorchโ16Nov 2, 2017Updated 8 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)โ65Nov 8, 2019Updated 6 years ago
- PyTorch implementation of the original evidental-deep-learning@https://github.com/aamini/evidential-deep-learning/โ13Sep 20, 2021Updated 4 years ago
- โ30Jan 17, 2022Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimationโ25Jul 18, 2023Updated 2 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"โ39Jun 11, 2025Updated 10 months ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actionsโ30Jun 30, 2020Updated 5 years ago
- โ21Jun 7, 2020Updated 5 years ago
- โ Minimal PyTorch Twin Delayed DDPG (TD3) implementationโ10Jun 20, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI โข AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Human Active Navigation Datasetโ14Sep 18, 2020Updated 5 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)โ26Oct 11, 2022Updated 3 years ago
- Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"โ15Apr 24, 2023Updated 3 years ago
- A Continual Multi-agent RL testbed based on Hanabiโ32Aug 1, 2021Updated 4 years ago
- โ25Jan 2, 2019Updated 7 years ago
- โ40Nov 17, 2021Updated 4 years ago
- My Body Is A Cageโ41Apr 13, 2021Updated 5 years ago