philipjball/SAC_PyTorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/philipjball/SAC_PyTorch)

philipjball / SAC_PyTorch

🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation

☆39

Alternatives and similar repositories for SAC_PyTorch

Users that are interested in SAC_PyTorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
dimarkov / pybefit
View on GitHub
Probabilistic inference for models of behaviour
☆13Mar 5, 2026Updated 4 months ago
conorheins / bayesian-mechanics-sdes
View on GitHub
☆14Oct 7, 2022Updated 3 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
christophmark / bayesianfridge
View on GitHub
Sequential Monte Carlo sampler for PyMC2 models.
☆14Apr 4, 2018Updated 8 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
xingchenwan / bgpbt
View on GitHub
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆31Sep 16, 2022Updated 3 years ago
joelouismarino / variational_rl
View on GitHub
Variational Reinforcement Learning
☆17Jul 25, 2024Updated last year
rmrafailov / LOMPO
View on GitHub
Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models
☆31Apr 30, 2021Updated 5 years ago
ctom2 / latent-space-transform
View on GitHub
[AAAI 2021 Workshop] The official repository for the LST-MAP model for few-shot image classification.
☆13Feb 12, 2021Updated 5 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chamorajg / pl-dreamer
View on GitHub
Simplistic Pytorch Implementation of the Dreamer-RL
☆20May 7, 2025Updated last year
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆422Jan 13, 2023Updated 3 years ago
tony23545 / DeepKoopman
View on GitHub
Use deep learning to learn Koopman operator and LQR for optimal control
☆18Sep 28, 2020Updated 5 years ago
owencqueen / RL_Final_Project
View on GitHub
"Adaptive Cruise Control for a Hybrid Vehicle with Deep Policy Gradients". Final project for ECE 517/414 Reinforcement Learning.
☆13Dec 8, 2021Updated 4 years ago
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆140Oct 15, 2022Updated 3 years ago
Kajiyu / kanerva_machine
View on GitHub
The implementation of "The Kanerva Machine" with Pytorch and Pyro
☆12Jun 14, 2018Updated 8 years ago
ulissigroup / uncertainty_benchmarking
View on GitHub
Various code/notebooks to benchmark different ways we could estimate uncertainty in ML predictions.
☆44Jun 7, 2021Updated 5 years ago
Harshs27 / neural-graphical-models
View on GitHub
Neural Graphical models are neural network based graphical models that offer richer representation, faster inference & sampling
☆30Aug 12, 2025Updated 11 months ago
Jiankai-Sun / Proximal-Policy-Optimization-Pytorch
View on GitHub
Proximal Policy Optimization(PPO) Algorithm and its distributed implementation in Pytorch
☆16Nov 2, 2017Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mbaltieri / GeneralisedFiltering
View on GitHub
General framework for Bayesian inversion of continuous hierarchical models
☆10Sep 20, 2021Updated 4 years ago
mengf1 / DHER
View on GitHub
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆65Nov 8, 2019Updated 6 years ago
hxu296 / torch-evidental-deep-learning
View on GitHub
PyTorch implementation of the original evidental-deep-learning@https://github.com/aamini/evidential-deep-learning/
☆13Sep 20, 2021Updated 4 years ago
PAL-ML / PEARL_v1
View on GitHub
☆30Jan 17, 2022Updated 4 years ago
RobertCsordas / moe
View on GitHub
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆39Jun 11, 2025Updated last year
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
ElisevanderPol / PRAE
View on GitHub
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
☆30Jun 30, 2020Updated 6 years ago
jerrywiston / RL-Mapless-Navigation
View on GitHub
☆21Jun 7, 2020Updated 6 years ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
stes / saliency
View on GitHub
Implementing Visual Saliency Models
☆13Jan 10, 2018Updated 8 years ago
vtolani95 / HumANav-Release
View on GitHub
Human Active Navigation Dataset
☆15Sep 18, 2020Updated 5 years ago
fusion-ml / trajectory-information-rl
View on GitHub
Bayesian active RL (BARL) and trajectory information planning (TIP)
☆26Oct 11, 2022Updated 3 years ago
DM2-ND / EDMem
View on GitHub
Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"
☆15Apr 24, 2023Updated 3 years ago
Felhof / DiscreteSAC
View on GitHub
☆40Nov 17, 2021Updated 4 years ago
chandar-lab / Lifelong-Hanabi
View on GitHub
A Continual Multi-agent RL testbed based on Hanabi
☆31Aug 1, 2021Updated 4 years ago
metadriverse / TS2C
View on GitHub
[ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"
☆14Apr 30, 2023Updated 3 years ago