A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
Alternatives and similar repositories for SimpleSAC
Users that are interested in SimpleSAC are comparing it to the libraries listed below
Sorting:
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆32Oct 26, 2022Updated 3 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆38Feb 19, 2022Updated 4 years ago
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Training tiny models to prove hard theorems☆59Mar 5, 2026Updated 2 weeks ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆18May 14, 2019Updated 6 years ago
- ☆18Mar 10, 2026Updated last week
- My Body Is A Cage☆41Apr 13, 2021Updated 4 years ago
- ☆19Jun 5, 2018Updated 7 years ago
- ☆24Feb 16, 2022Updated 4 years ago
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- PyTorch implementation of AVF☆45Sep 2, 2020Updated 5 years ago
- Efficient joint input optimization and inference with DEQ☆10Nov 25, 2021Updated 4 years ago
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Jan 19, 2023Updated 3 years ago
- Code to reproduce experiments from:☆10Dec 11, 2020Updated 5 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆53Jan 20, 2023Updated 3 years ago
- This is a ROS repository to track an underwater target using a Particle Filter range-only method and the SparusII AUV☆11Nov 27, 2024Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- ☆23Aug 19, 2022Updated 3 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20May 25, 2021Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- 硕士毕业论文代码 深度强化学习☆10Apr 4, 2020Updated 5 years ago
- Go through the list of accepted papers for ICLR in terminal and add them to your reading list.☆13Jan 30, 2021Updated 5 years ago
- Related paper: Online Scheduling for Energy Minimization in Wireless Powered Mobile Edge Computing☆10Jan 5, 2023Updated 3 years ago
- Implementations of differentiable stacks, queues, and deques from "Learning to Transduce with Unbounded Memory"☆20Sep 8, 2015Updated 10 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- In Defense of the Unitary Scalarization for Deep Multi-Task Learning☆21Mar 8, 2023Updated 3 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Jan 31, 2020Updated 6 years ago
- ☆10Apr 7, 2021Updated 4 years ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Aug 5, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆106Jun 9, 2020Updated 5 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆15Sep 26, 2022Updated 3 years ago
- Provides a jailbreak experience of AWS DeepRacer, giving us more control over the training/simulation process and RL algorithm tuning☆18Feb 17, 2023Updated 3 years ago