yining043/SAC-discrete

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yining043/SAC-discrete)

yining043 / SAC-discrete

Modified versions of the Soft Actor-Critic algorithm for Atari games from https://github.com/ac-93/soft-actor-critic.

☆20

Alternatives and similar repositories for SAC-discrete

Users that are interested in SAC-discrete are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ac-93 / soft-actor-critic
View on GitHub
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆98Jun 22, 2020Updated 6 years ago
toshikwa / sac-discrete.pytorch
View on GitHub
PyTorch implementation of SAC-Discrete.
☆316Jul 25, 2024Updated 2 years ago
LihaoR / Entropy-Regularized-RL
View on GitHub
soft q learning and soft actor critic
☆16Dec 23, 2018Updated 7 years ago
marooncn / RLnotes
View on GitHub
☆34May 25, 2020Updated 6 years ago
flint-xf-fan / MLDA-Workshop
View on GitHub
ML/DL training workshops for EEE undergrads
☆13Jan 16, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CursedSeraphim / icmppo
View on GitHub
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆18Apr 15, 2022Updated 4 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
chvmp / pychamp
View on GitHub
☆13May 30, 2021Updated 5 years ago
WPI-MMR / gym_solo
View on GitHub
A custom open ai gym environment for solo experimentation.
☆12Apr 14, 2021Updated 5 years ago
rasmushaugaard / spyropose
View on GitHub
SpyroPose (ICCVW 2023)
☆10Jun 30, 2025Updated last year
dtkon / PDP-NCS
View on GitHub
Detian Kong, Yining Ma, Zhiguang Cao, Tianshu Yu and Jianhua Xiao, "Efficient Neural Collaborative Search for Pickup and Delivery Problem…
☆15Feb 11, 2025Updated last year
GilbertPan97 / RobCalib_AXYB
View on GitHub
RobCalib_AXYB is a comprehensive toolbox designed to address the AX=YB calibration problem between a robot (hand) and a camera (eye), ser…
☆10Jun 7, 2024Updated 2 years ago
jieyibi / CaR-constraint
View on GitHub
[ICLR 2026] Towards Efficient Constraint Handling in Neural Solvers for Routing Problems
☆15Feb 20, 2026Updated 5 months ago
AadityaRavindran / gym-cartpolemod
View on GitHub
Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller
☆10Dec 5, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gpujs / matrix-log.js
View on GitHub
A matrix log utility. Useful for converting CPU code to GPU code.
☆18Updated this week
jingwenli0312 / Feature-Embedding-Refiner
View on GitHub
☆18Jun 30, 2023Updated 3 years ago
MetaEvo / Symbol
View on GitHub
Python implementation of SYMBOL
☆18Feb 29, 2024Updated 2 years ago
ServiceNow / MiniTouch
View on GitHub
MiniTouch is a ServiceNow Research project that was started at Element AI.
☆14Jul 5, 2023Updated 3 years ago
xsaxy / Sabaki-zh-CN
View on GitHub
这是Sabaki的中文版。下载最新版本
☆19Jun 15, 2019Updated 7 years ago
hlfshell / wpi-capstone
View on GitHub
WPI Capstone Project for Group 2, 2023
☆18Feb 14, 2024Updated 2 years ago
WilsonWangTHU / POPLIN
View on GitHub
☆99Mar 24, 2023Updated 3 years ago
haewngX / PathPlanning_demo
View on GitHub
cpp implementation of PathPlanning algorithm in PythonRobotics
☆16Jul 9, 2019Updated 7 years ago
kngwyu / Rainy
View on GitHub
Deep RL agents with PyTorch
☆35Sep 25, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mattclifford1 / IQM-Vis
View on GitHub
Image Quality Metric Visualision. An extendable user interface for the assessment of transformations on image metrics.
☆25Mar 6, 2026Updated 4 months ago
lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 7 years ago
deep-reinforcement-learning-book / Chapter15-AlphaZero
View on GitHub
Chapter 15 AlphaZero in book Deep Reinforcement Learning: code example of AlphaZero solving Gomoku game.
☆36Feb 18, 2020Updated 6 years ago
SUBER-Team / SUBER
View on GitHub
This repository accompanies our research paper titled "An LLM-based Recommender System Environment".
☆17Jul 15, 2024Updated 2 years ago
water-mirror / DPR
View on GitHub
Dynamic Partial Removal: a Neural Network Heuristic for Large Neighborhood Search on Combinatorial Optimization Problems, by applying dee…
☆20Jun 17, 2020Updated 6 years ago
hongyanz / TRADES-smoothing
View on GitHub
[JMLR] TRADES + random smoothing for certifiable robustness
☆14Sep 13, 2020Updated 5 years ago
SITE5039 / AdaMixUp
View on GitHub
☆14May 7, 2019Updated 7 years ago
marcalexa / superfibonacci
View on GitHub
☆19Mar 8, 2023Updated 3 years ago
jmichaux / intrinsic-motivation
View on GitHub
Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards
☆10Sep 17, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
cmmp / pyproclus
View on GitHub
A python implementation of PROCLUS: PROjected CLUStering algorithm.
☆10Jan 12, 2015Updated 11 years ago
virgile-hogman / kth-rgbd
View on GitHub
Visual SLAM from RGB-D data using Microsoft Kinect
☆10May 13, 2016Updated 10 years ago
Lynn1 / llama3-stream
View on GitHub
A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…
☆13Jul 31, 2024Updated last year
davidstutz / aml-improved-shape-completion
View on GitHub
ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.
☆11Nov 30, 2018Updated 7 years ago
anonymized-research / progen2
View on GitHub
☆11Sep 27, 2022Updated 3 years ago
HetTransformer / HetTransformer-model
View on GitHub
☆10Jun 21, 2021Updated 5 years ago
camillol / MacTorcs
View on GitHub
Mac port of Torcs, The Open Racing Car Simulator
☆11Jun 16, 2010Updated 16 years ago