AutumnWu/Streamlined-Off-Policy-Learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AutumnWu/Streamlined-Off-Policy-Learning)

AutumnWu / Streamlined-Off-Policy-Learning

ICRL 2020

☆20

Alternatives and similar repositories for Streamlined-Off-Policy-Learning

Users that are interested in Streamlined-Off-Policy-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lanyavik / BAIL
View on GitHub
☆18Jul 13, 2022Updated 4 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
Ji4chenLi / Multi-Task-Batch-RL
View on GitHub
☆26Mar 16, 2023Updated 3 years ago
zzyunzhi / asynch-mb
View on GitHub
(CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning
☆14Dec 27, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
richardrl / fetch-block-construction
View on GitHub
Environment codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"
☆14Jul 22, 2020Updated 6 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
da-molchanov / variance-networks
View on GitHub
Variance Networks: When Expectation Does Not Meet Your Expectations, ICLR 2019
☆39Jan 31, 2020Updated 6 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
watchernyu / REDQ
View on GitHub
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆185Nov 14, 2024Updated last year
tmoer / multimodal_varinf
View on GitHub
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35May 24, 2018Updated 8 years ago
watchernyu / setup-mujoco-gym-for-DRL
View on GitHub
Guide on how to set up openai gym and mujoco for deep reinforcement learning research.
☆16Jan 12, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
xuzhiyuan1528 / KTM-DRL
View on GitHub
☆20Oct 23, 2020Updated 5 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
jxu43 / replication-mbpo
View on GitHub
NeurIPS Reproducibility Challenge 2019
☆21Feb 25, 2020Updated 6 years ago
singhalrk / stein_ksd
View on GitHub
☆10Apr 2, 2018Updated 8 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
MiscellaneousStuff / LeagueSandbox-RL-Learning
View on GitHub
Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…
☆13Feb 23, 2025Updated last year
tedmoskovitz / WNPG
View on GitHub
implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies
☆13Mar 9, 2021Updated 5 years ago
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RonanFR / UCRL
View on GitHub
☆27May 17, 2019Updated 7 years ago
vaishak2future / sac
View on GitHub
Implementation of Soft Actor Critic
☆37Aug 27, 2021Updated 4 years ago
DrJimFan / SECANT
View on GitHub
[ICML 2021] Official code for SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
☆54Jul 5, 2023Updated 3 years ago
Baichenjia / Contrastive-UCB
View on GitHub
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
☆12Jun 16, 2022Updated 4 years ago
qlan3 / Explorer
View on GitHub
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆98Updated this week
SanderJSA / Pomodoro
View on GitHub
A simple CLI pomodoro timer written in Rust.
☆15Nov 17, 2023Updated 2 years ago
MichaelArbel / KWNG
View on GitHub
A Pytorch implementation of the KWNG estimator
☆14Jul 25, 2024Updated last year
SukerZ / The-PyTorch-Self-Driving-Experiment-by-DDPG-on-TORCS
View on GitHub
用PyTorch重构流传最广的Keras、TensorFlow做的TORCS实验。训练DDPG模型。
☆12Dec 23, 2018Updated 7 years ago
microsoft / oac-explore
View on GitHub
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
☆70Aug 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
justinjfu / diagnosing_qlearning
View on GitHub
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆17May 14, 2019Updated 7 years ago
haotian-liu / transformers_llava
View on GitHub
☆16Apr 28, 2023Updated 3 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
rcheng805 / CORE-RL
View on GitHub
Code implementing the CORE-RL algorithm with DDPG, PPO, and TRPO. See the paper "Control Regularization for Reduced Variance Reinforcemen…
☆32Jan 7, 2021Updated 5 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 7 years ago
flatwhatson / doom.d
View on GitHub
No Rest for the Living
☆13Nov 13, 2022Updated 3 years ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago