shakti365/soft-actor-critic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shakti365/soft-actor-critic)

shakti365 / soft-actor-critic

TF2 Implementation of the Soft Actor-Critic Algorithm

☆43

Alternatives and similar repositories for soft-actor-critic

Users that are interested in soft-actor-critic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LihaoR / Entropy-Regularized-RL
View on GitHub
soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
IgnacioCarlucho / DDPG_MountainCar
View on GitHub
The continuous mountain car problem solved with DDPG
☆13Apr 19, 2020Updated 6 years ago
mengf1 / CHER
View on GitHub
Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)
☆67Feb 14, 2020Updated 6 years ago
kushagra06 / SAC
View on GitHub
Pytorch implementation of Soft Actor-Critic
☆20Apr 13, 2020Updated 6 years ago
google-deepmind / csuite
View on GitHub
☆47Sep 24, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rail-berkeley / softlearning
View on GitHub
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…
☆1,434Nov 29, 2023Updated 2 years ago
YYCAAA / V-MPO_Lunarlander
View on GitHub
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
uidilr / deepirl_chainer
View on GitHub
Implementation of GAIL and AIRL using chinerrl
☆16Jun 21, 2022Updated 4 years ago
mitmedialab / kukaslxctrl
View on GitHub
A small library intended for controlling KUKA robots using KRC4 over KUKA RSI (Robot Sensor Interface) from Simulink.
☆11Mar 30, 2017Updated 9 years ago
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
View on GitHub
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆107Jun 7, 2019Updated 7 years ago
fangihsiao / GAIL-Tensorflow
View on GitHub
GAIL implementation using Tensorflow
☆14Sep 17, 2019Updated 6 years ago
nric / ProximalPolicyOptimizationKeras
View on GitHub
This is a deterministic Tensorflow 2.0 (keras) implementation of a Open Ai's proximal policy optimization actor critic algorithm PPO.
☆12Sep 3, 2020Updated 5 years ago
BY571 / Soft-Actor-Critic-and-Extensions
View on GitHub
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆297Feb 24, 2021Updated 5 years ago
jimilong / socket_c
View on GitHub
tcp socket封包，解包,粘包处理（包头＋包体）
☆10Sep 6, 2016Updated 9 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
lineCode / rl_atari_pytorch
View on GitHub
ReinforcementLearning Learn Play Atari Using DDPG and LSTM.
☆20Aug 5, 2017Updated 8 years ago
zero-cola / Posco-AI-Challenge-2018
View on GitHub
동해안 너울성 파도 발생 시점 예측 (1위 수상)
☆12Sep 30, 2018Updated 7 years ago
minseop4898 / RL-Course-by-David-Silver
View on GitHub
☆13Aug 29, 2019Updated 6 years ago
whitegreen / JavaKUKA
View on GitHub
Javakuka is an open-source project for creating Kuka Robot Language (KRL) codes in Java.
☆10May 15, 2024Updated 2 years ago
carlospurves / psxle
View on GitHub
A Python interface to the Sony PlayStation console.
☆25Dec 14, 2019Updated 6 years ago
li-xl / rotationconverter
View on GitHub
A converter for Euler Angle,Axis Angle,Quaternion,Rotation Matrix.
☆16Jun 9, 2021Updated 5 years ago
tgisaturday / Seq2CNN
View on GitHub
Word Embedding Annealing Using Sequence-to-sequence Model
☆16Dec 2, 2020Updated 5 years ago
reinforcement-learning-kr / lets-do-irl
View on GitHub
Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)
☆781Dec 22, 2023Updated 2 years ago
erwincoumans / ARS
View on GitHub
An implementation of the Augmented Random Search algorithm
☆14Jan 29, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
createamind / Planet
View on GitHub
☆22Dec 8, 2022Updated 3 years ago
notplaid / prices
View on GitHub
住房月租金预测大数据赛
☆17Nov 28, 2018Updated 7 years ago
ac-93 / soft-actor-critic
View on GitHub
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆99Jun 22, 2020Updated 6 years ago
wangyy161 / DDPG_CNN_Pendulum_practice
View on GitHub
practice
☆11Jun 30, 2020Updated 6 years ago
mnourgwad / zuka
View on GitHub
Application of the Industrial Robotic Arm KR6 R900 sixx in 3D Milling that includes developing post-processing tools to convert any conve…
☆13Jul 7, 2026Updated 2 weeks ago
guirmoreira / kuka-python
View on GitHub
Python library used to communicate and control Kuka manipulator (tested on KR16 model). Under continuous development.
☆12Aug 9, 2018Updated 7 years ago
vaishak2future / sac
View on GitHub
Implementation of Soft Actor Critic
☆37Aug 27, 2021Updated 4 years ago
gditzler / ConceptDriftData
View on GitHub
Generate synthetic data sets containing concept drift, or load one of two real-world concept drift benchmark data sets.
☆12May 10, 2013Updated 13 years ago
adik993 / reinforcement-learning-sutton
View on GitHub
☆16Mar 4, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
a791702141 / SSG
View on GitHub
This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…
☆12Nov 4, 2022Updated 3 years ago
act3-ace / aerospaceRL
View on GitHub
☆14Dec 14, 2021Updated 4 years ago
SamuelSchmidgall / EvolutionarySelfReplication
View on GitHub
Produce intelligence by means of natural selection without objective/reward optimization
☆16Sep 29, 2021Updated 4 years ago
LucasAlegre / mbcd
View on GitHub
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆11Aug 7, 2023Updated 2 years ago
ubc-tea / Local-Superior-Soups
View on GitHub
☆15Dec 10, 2024Updated last year
lucienerdin / DistributedStateEstimation_PowerSystems
View on GitHub
Code for a centralised State Estimation and a distributed State Estimation in Power Transmission Networks
☆11Jul 11, 2021Updated 5 years ago
EXPSIN / Quadratic-Programming-for-Continuous-Control-of-Safety-Critical-Multi-Agent-Systems-Under-Uncertaint
View on GitHub
Quadratic Programming for Continuous Control of Safety-Critical Multi-Agent Systems Under Uncertainty
☆14Sep 7, 2024Updated last year