young-geng/SimpleSAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/young-geng/SimpleSAC)

young-geng / SimpleSAC

A simple and easy to use implementation of the soft actor-critic algorithm.

☆15

Alternatives and similar repositories for SimpleSAC

Users that are interested in SimpleSAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
joelouismarino / variational_rl
View on GitHub
Variational Reinforcement Learning
☆17Jul 25, 2024Updated last year
philipjball / SAC_PyTorch
View on GitHub
🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation
☆39Feb 19, 2022Updated 4 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
yobibyte / amorpheus
View on GitHub
My Body Is A Cage
☆41Apr 13, 2021Updated 5 years ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆140Oct 15, 2022Updated 3 years ago
aviralkumar2907 / MMCE
View on GitHub
☆19Jun 5, 2018Updated 8 years ago
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
rail-berkeley / design-baselines
View on GitHub
☆24Feb 16, 2022Updated 4 years ago
joelouismarino / amortized-variational-filtering
View on GitHub
PyTorch implementation of AVF
☆45Sep 2, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
liumengyu0817 / DROO_version3
View on GitHub
硕士毕业论文代码深度强化学习
☆10Apr 4, 2020Updated 6 years ago
GGchen1997 / BDI
View on GitHub
This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…
☆14Jan 19, 2023Updated 3 years ago
julienroyd / coordination-marl
View on GitHub
Code to reproduce experiments from:
☆10Dec 11, 2020Updated 5 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
sihyun-yu / RoMA
View on GitHub
[NeurIPS'21] RoMA: Robust Model Adaptation for Offline Model-based Optimization
☆15Oct 28, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
coolmoon327 / Online-Scheduling-for-Energy-Minimization-in-Wireless-Powered-Mobile-Edge-Computing
View on GitHub
Related paper: Online Scheduling for Energy Minimization in Wireless Powered Mobile Edge Computing
☆10Jan 5, 2023Updated 3 years ago
PrajitR / NeuralStacksQueues
View on GitHub
Implementations of differentiable stacks, queues, and deques from "Learning to Transduce with Unbounded Memory"
☆20Sep 8, 2015Updated 10 years ago
yobibyte / iclr-viewer
View on GitHub
Go through the list of accepted papers for ICLR in terminal and add them to your reading list.
☆13Jan 30, 2021Updated 5 years ago
chriscremer / Inference-Suboptimality
View on GitHub
Code for 'Inference Suboptimality in Variational Autoencoders'
☆11May 22, 2020Updated 6 years ago
imasmitja / stalker
View on GitHub
This is a ROS repository to track an underwater target using a Particle Filter range-only method and the SparusII AUV
☆11Nov 27, 2024Updated last year
jhejna / morphology-opt
View on GitHub
Code for the paper Task Agnostic Morphology Evolution.
☆21May 25, 2021Updated 5 years ago
yobibyte / unitary-scalarization-dmtl
View on GitHub
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
☆22Mar 8, 2023Updated 3 years ago
zuoxingdong / dm2gym
View on GitHub
Convert DeepMind Control Suite to OpenAI gym environments.
☆87Jan 31, 2020Updated 6 years ago
young-geng / JaxCQL
View on GitHub
Conservative Q learning in Jax
☆58Feb 7, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Wnight963 / UAV_Optim_Pytorch
View on GitHub
☆10Apr 7, 2021Updated 5 years ago
ying-wen / gr2
View on GitHub
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
☆14Dec 8, 2022Updated 3 years ago
cgrivera / ai-arena
View on GitHub
The AI Arena: A framework for distributed multi-agent reinforcement learning
☆14Aug 5, 2022Updated 3 years ago
ethanluoyc / e2c-pytorch
View on GitHub
E2C implementation in PyTorch
☆43Jul 5, 2017Updated 9 years ago
wiseodd / compound-density-networks
View on GitHub
Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…
☆16May 26, 2022Updated 4 years ago
itaicaspi / inception-v4.torch
View on GitHub
GoogLeNet Inception arhitecture v4 implementation on torch
☆11Mar 18, 2016Updated 10 years ago
aws-deepracer / aws-deepracer-notebooks
View on GitHub
Provides a jailbreak experience of AWS DeepRacer, giving us more control over the training/simulation process and RL algorithm tuning
☆18Feb 17, 2023Updated 3 years ago