Pytorch implementation of Soft Actor-Critic
☆20Apr 13, 2020Updated 6 years ago
Alternatives and similar repositories for SAC
Users that are interested in SAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 现在好用的能同步的网盘都没有了,于是自己用阿里云的OSS撸了一个☆10Apr 22, 2018Updated 8 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆361Feb 3, 2020Updated 6 years ago
- KERL: reinforcement learning algorithms and tools implemented using Keras☆11Aug 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- TF2 Implementation of the Soft Actor-Critic Algorithm☆43Dec 8, 2022Updated 3 years ago
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆13Sep 11, 2022Updated 3 years ago
- For TAMP experiments using Drake☆13Jun 4, 2024Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- Contact Planning for Object Manipulation via Monte Carlo Tree Search☆14May 13, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Reinforcement Learning DQN on Unity ML Agent☆11Sep 2, 2018Updated 7 years ago
- ☆16Feb 26, 2019Updated 7 years ago
- A Julia package for consensus-based optimisation☆16Updated this week
- Code for experimenting with state and action abstractions in reinforcement learning.☆29Dec 11, 2020Updated 5 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Summary for MPC 2017 at ETH Zürich☆12Oct 24, 2018Updated 7 years ago
- Official implementation of MacroRank: Ranking Macro Placement Solutions Leveraging Translation Equivariancy (ASP-DAC 2023)☆18Jun 3, 2023Updated 3 years ago
- An PyTorch implementation of "Importance Weighted Actor-Learner Architectures" https://arxiv.org/abs/1802.01561☆12Jan 6, 2021Updated 5 years ago
- Pytorch implimentation of the paper: "Deep Visual Constraints: Neural Implicit Models for Manipulation Planning from Visual Input"☆18Dec 23, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the TD3 algorithm written in Pytorch☆12Dec 8, 2022Updated 3 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Jul 25, 2024Updated last year
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆57Oct 18, 2022Updated 3 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- ☆13Feb 5, 2025Updated last year
- ☆18Mar 19, 2019Updated 7 years ago
- This is a mirror of the PDDL4J project on SourceForge. PDDL4J is an open source library to facilitate java implementation of planners bas…☆18Sep 27, 2012Updated 13 years ago
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14May 31, 2023Updated 3 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Dec 1, 2019Updated 6 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning☆12Dec 20, 2020Updated 5 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- Code for☆15Oct 16, 2020Updated 5 years ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year