philipjball/OffCon3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/philipjball/OffCon3)

philipjball / OffCon3

📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)

☆25

Alternatives and similar repositories for OffCon3

Users that are interested in OffCon3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
vwxyzjn / a2c_is_a_special_case_of_ppo
View on GitHub
A2C is a special case of PPO!
☆23May 20, 2022Updated 4 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 4 years ago
lanyavik / BAIL
View on GitHub
☆18Jul 13, 2022Updated 4 years ago
juliuskunze / cwvae-jax
View on GitHub
Clockwork VAEs in JAX/Flax
☆32Jul 16, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
instadeepai / fastpbrl
View on GitHub
Vectorization techniques for fast population-based training.
☆57Apr 26, 2026Updated 2 months ago
chandar-lab / Lifelong-Hanabi
View on GitHub
A Continual Multi-agent RL testbed based on Hanabi
☆31Aug 1, 2021Updated 4 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
rmrafailov / LOMPO
View on GitHub
Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models
☆31Apr 30, 2021Updated 5 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
jetnew / visrl
View on GitHub
A simple wrapper to analyse and visualise reinforcement learning agents' behaviour in the environment.
☆14Jan 8, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ucl-dark / paired
View on GitHub
PAIRED in PyTorch 🔥
☆65Mar 8, 2023Updated 3 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
ericmedvet / 2dhmsr
View on GitHub
Java framework for experimenting with a 2-D version of the voxel-based soft robots.
☆20Mar 31, 2023Updated 3 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
xingchenwan / bgpbt
View on GitHub
[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)
☆31Sep 16, 2022Updated 3 years ago
avelazquez15 / DPM
View on GitHub
Dynamic Power Management using Reinforcement Learning for IoT devices.
☆11Oct 23, 2021Updated 4 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
Bam4d / Neural-Game-Engine
View on GitHub
Code to reproduce Neural Game Engine experiments and pre-trained models
☆41Jun 22, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rcao-hk / td3_her_rlbench_reacher
View on GitHub
A implementation for soving reach target task based on TD3 with HER using PaddlePaddle.
☆12Aug 10, 2020Updated 5 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
KevinKaiwenYu / DMAB-for-dynamic-beamforming-with-local-observation
View on GitHub
Main data and model for submitted paper: Energy-Efficient Multi-Cell Beamforming via Multi-Agent Reinforcement Learning
☆12Jan 24, 2024Updated 2 years ago
WPI-MMR / gym_solo
View on GitHub
A custom open ai gym environment for solo experimentation.
☆12Apr 14, 2021Updated 5 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
eager-dev / eagerx_tutorials
View on GitHub
Tutorials on how to use EAGERx
☆16Aug 14, 2025Updated 11 months ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
bmazoure / ppo_jax
View on GitHub
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆62Aug 4, 2022Updated 3 years ago
sheydashz / federated-double-deep-Q_network-
View on GitHub
A framework that exploits the potentials of distributed federated learning and double deep Q-networks to minimize joint energy and delay …
☆11Apr 21, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PierreExeter / rl_reach
View on GitHub
RL Reach is a platform for running reproducible reinforcement learning experiments.
☆46Jan 31, 2026Updated 5 months ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
montrealrobotics / unsupervised-adr
View on GitHub
Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL
☆12Aug 4, 2020Updated 5 years ago
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆422Jan 13, 2023Updated 3 years ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago