kushagra06/SAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kushagra06/SAC)

kushagra06 / SAC

Pytorch implementation of Soft Actor-Critic

☆20

Alternatives and similar repositories for SAC

Users that are interested in SAC are comparing it to the libraries listed below

Sorting:

sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated last year
hwang-ua / inac_pytorch
View on GitHub
☆19Jun 25, 2023Updated 2 years ago
shakti365 / soft-actor-critic
View on GitHub
TF2 Implementation of the Soft Actor-Critic Algorithm
☆44Dec 8, 2022Updated 3 years ago
PdIPS / ConsensusBasedX.jl
View on GitHub
A Julia package for consensus-based optimisation
☆16Nov 28, 2025Updated 3 months ago
gouxiangchen / ac-ppo
View on GitHub
Actor-Critic and openAI clipped PPO in gym cartpole-v0 and pendulum-v0 environment
☆27Aug 2, 2020Updated 5 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 6 years ago
david-abel / rl_abstraction
View on GitHub
Code for experimenting with state and action abstractions in reinforcement learning.
☆30Dec 11, 2020Updated 5 years ago
schroederdewitt / mackrl
View on GitHub
Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)
☆33Dec 1, 2019Updated 6 years ago
k1l1 / CoCoFL
View on GitHub
CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization
☆13Aug 3, 2024Updated last year
BHoM / RDF_Prototypes
View on GitHub
Research project of the Cluster of Excellence "Integrative Computational Design and Construction for Architecture" (IntCDC) https://www.i…
☆10Sep 3, 2024Updated last year
enlite-ai / maze_smaac
View on GitHub
Semi-Markov Afterstate Actor-Critic (SMAAC) with Maze
☆11Nov 16, 2021Updated 4 years ago
david-abel / state_abstraction
View on GitHub
Code for abstracting, evaluating, and visualizing Markov Decision Processes.
☆10Jan 12, 2017Updated 9 years ago
Ktakuya332C / deepcube
View on GitHub
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Dec 9, 2018Updated 7 years ago
askolik / quantum_agents
View on GitHub
Code for Q-learning with parametrized quantum circuits in OpenAI Gym environments.
☆13Nov 12, 2021Updated 4 years ago
theodoradragan / QuantumAutoencoder
View on GitHub
Implementing https://arxiv.org/abs/1612.02806
☆13Sep 10, 2021Updated 4 years ago
instance01 / fish-rl-alife
View on GitHub
Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.
☆10Nov 30, 2021Updated 4 years ago
google-deepmind / csuite
View on GitHub
☆46Sep 24, 2024Updated last year
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 4 years ago
JuliaPOMDP / POMCP.jl
View on GitHub
Julia Implementation of the POMCP algorithm for solving POMDPs
☆12Aug 6, 2021Updated 4 years ago
Alekat13 / Deep-Reinforcement-Ant-Colony-Optimization-
View on GitHub
Swarm learning algorithm
☆11Jun 2, 2021Updated 4 years ago
changh95 / Study-Resources-Review
View on GitHub
Personal reviews of popular MOOCs (Massive Open Online Courses) and study plans
☆11Mar 27, 2019Updated 6 years ago
SUIBE-Blockchain / Super-NFT
View on GitHub
基于 FISCO BCOS / Vechain 的超级 NFT 平台。
☆11May 11, 2021Updated 4 years ago
KomeijiForce / Active_Passive_Constraint_Koishiday_2024
View on GitHub
Koishi's Day 2024 Paper (NeurIPS 2024): An advanced persona-driven role-playing system with global faithfulness quantification and optimi…
☆11Oct 19, 2025Updated 4 months ago
mjamroz / PlantRecognition
View on GitHub
Example of android app written in Qt/Qml which uses MXNet for plant image recognition.
☆10Nov 4, 2017Updated 8 years ago
Miyembe / frl_swarm
View on GitHub
Federated Deep Reinforcement Learning for Swarm Robotic Systems
☆10Jun 2, 2022Updated 3 years ago
karminski / system-prompt-research
View on GitHub
一个分析大型语言模型系统提示词的研究项目
☆73Oct 13, 2025Updated 4 months ago
Gamma-Software / CustomerCareAI
View on GitHub
This project aims at giving the best customer service ever using the power of LLM models like GPT.
☆10Jun 29, 2023Updated 2 years ago
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆14Dec 13, 2025Updated 2 months ago
wqynew / Enhanced-NeoNav
View on GitHub
Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning
☆12Dec 20, 2020Updated 5 years ago
toshikwa / slac.pytorch
View on GitHub
PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).
☆94Jul 25, 2024Updated last year
roosephu / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Jul 26, 2019Updated 6 years ago
cfsantos / MaxDropout-torch
View on GitHub
3.76%, 18.81% on CIFAR10, CIFAR100 https://arxiv.org/
☆10Jul 28, 2020Updated 5 years ago
devidduma / mujoco-benchmark
View on GitHub
MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.
☆15Jan 12, 2025Updated last year
VITA-Group / VGAI
View on GitHub
[IEEE TSIPN' 2022] "Scalable Perception-Action-Communication Loops with Convolutional and Graph Neural Networks", by Ting-Kuei Hu, Fernan…
☆15Feb 4, 2022Updated 4 years ago
roosephu / boots
View on GitHub
☆11Oct 14, 2019Updated 6 years ago
ZeroChiLi / TanksPluggableAI
View on GitHub
Stuy Pluggable AI
☆11Oct 3, 2017Updated 8 years ago
SioKCronin / Hindsight-Experience-Replay
View on GitHub
PPO with Hindsight Experience Replay (HER)
☆11May 8, 2018Updated 7 years ago
MHenderson1988 / line-of-sight-tool-python
View on GitHub
A Python application for processing multiple line of sight queries, via the 'Google Elevation' API.
☆10May 23, 2023Updated 2 years ago
huaxiuyao / HSML_Dynamic
View on GitHub
HSML Dynamic version for ICML 2019
☆12Jul 11, 2019Updated 6 years ago