oxwhirl/opiq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/oxwhirl/opiq)

oxwhirl / opiq

Code for Optimistic Exploration even with a Pessimistic Initialisation

☆14

Alternatives and similar repositories for opiq

Users that are interested in opiq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 7 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
jinnaiyuu / Optimal-Options-ICML-2019
View on GitHub
Code for generating options for planning and reinforcement learning
☆12Feb 18, 2021Updated 5 years ago
pkumusic / E-DRL
View on GitHub
Exploration Strategies for Deep Reinforcement Learning
☆39Oct 31, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yfletberliac / adversarially-guided-actor-critic
View on GitHub
AGAC: Adversarially Guided Actor-Critic
☆47Sep 16, 2021Updated 4 years ago
rasoolfa / P3O
View on GitHub
P3O paper code
☆30Aug 7, 2019Updated 6 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
drivingstereo-dataset / drivingstereo-dataset.github.io
View on GitHub
A Large-Scale Dataset for Stereo Matching in Autonomous Driving Scenarios
☆10Oct 25, 2021Updated 4 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
jbuckman / dmdp-donutworld
View on GitHub
☆13Jul 25, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
TARTRL / TiZero
View on GitHub
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
☆14May 25, 2023Updated 3 years ago
tung-nd / ExPT
View on GitHub
☆19Mar 31, 2024Updated 2 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
facebookresearch / impact-driven-exploration
View on GitHub
impact-driven-exploration
☆136Oct 3, 2023Updated 2 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
yobibyte / pgn
View on GitHub
Graph Nets in pytorch
☆28Dec 8, 2022Updated 3 years ago
thsunkid / Understanding-CNN-and-Neural-Style-Transfer
View on GitHub
Visualizing CNN and Neural aesthetics
☆13Dec 12, 2018Updated 7 years ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vluzko / dac-iclr-reproducibility
View on GitHub
ICLR Reproducibility Challenge for Discriminator-Actor-Critic
☆20Jan 7, 2019Updated 7 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
google-deepmind / dm_hard_eight
View on GitHub
☆85Nov 19, 2020Updated 5 years ago
uoe-agents / MATE
View on GitHub
Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
☆15Apr 25, 2024Updated 2 years ago
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 7 years ago
google-research / pisac
View on GitHub
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
☆45Jun 8, 2023Updated 3 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
igilitschenski / multi_car_racing
View on GitHub
An OpenAI Gym environment for multi-agent car racing based on Gym's original car racing environment.
☆91Feb 20, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
end3r / Gamepad-API-Content-Kit
View on GitHub
Gamepad API Content Kit
☆14Jun 1, 2016Updated 10 years ago
ucl-dark / skillhack
View on GitHub
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Oct 23, 2022Updated 3 years ago
VinAIResearch / PC3-pytorch
View on GitHub
Predictive Coding for Locally-Linear Control (ICML-2020)
☆18Jul 22, 2024Updated 2 years ago
lmb-freiburg / td-or-not-td
View on GitHub
Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…
☆12Aug 24, 2018Updated 7 years ago
YRussac / WeightedLinearBandits
View on GitHub
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago
Hyp-ed / hyped-2019
View on GitHub
SpaceX Hyperloop Competiton 2019
☆10Jul 30, 2019Updated 6 years ago
salinisenthil / ConnGO
View on GitHub
Workflow for CONNectivity preserving Geometry Optimization
☆11Sep 2, 2021Updated 4 years ago