robintyh1/neurips2021-meta-gradient-offpolicy-evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/robintyh1/neurips2021-meta-gradient-offpolicy-evaluation)

robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation

Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021

☆13

Alternatives and similar repositories for neurips2021-meta-gradient-offpolicy-evaluation

Users that are interested in neurips2021-meta-gradient-offpolicy-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

waterhorse1 / NAC
View on GitHub
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
godmoves / reinforcement_learning_collections
View on GitHub
A collection of deep reinforcement learning algorithm implementations
☆11Jan 9, 2020Updated 6 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
adinarad / IK-3LinkManipulator-NN
View on GitHub
Solving the inverse kinematics problem of a 3 Link Planar Manipulator using neural networks.
☆10Jul 19, 2020Updated 6 years ago
RobvanGastel / meta-rl-algorithms
View on GitHub
A collection of Meta-Reinforcement Learning algorithms in PyTorch
☆51Jul 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
j3soon / dfac
View on GitHub
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆31Jun 1, 2023Updated 3 years ago
anair13 / bullet-manipulation-affordances
View on GitHub
☆13Jun 3, 2022Updated 4 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
yifanycc / AdaZeta
View on GitHub
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆13Dec 15, 2024Updated last year
guojiapub / BiQUE
View on GitHub
☆11Apr 26, 2023Updated 3 years ago
moorecsys / moorecsys.github.io
View on GitHub
Tutorial on Multi-Objective Recommender Systems @ KDD 2021
☆19Dec 4, 2022Updated 3 years ago
pritismankar-maan / Control-of-Robotics-ARM-using-PID-ANN-Fuzzy-controllers
View on GitHub
Comparison between a PID, ANN & Fuzzy-PID controller while controlling a Robotic Arm in Simulink
☆11Jan 31, 2022Updated 4 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆10Apr 23, 2021Updated 5 years ago
harris-chris / joint-shapley-values
View on GitHub
Source code for the Joint Shapley values: a measure of joint feature importance
☆12Sep 14, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
icaros-usc / dqd-rl
View on GitHub
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆22Oct 3, 2022Updated 3 years ago
sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
mdurmuss / boa
View on GitHub
Butterfly Optimization Algorithm for Weapon Target Assignment Problem
☆12Jul 24, 2021Updated 5 years ago
hpi-sam / GNN-SpaceTimeGraphs
View on GitHub
Graph Neural Networks utilization for Spatiotemporal graphs. These methods will be applied into the problem of forecasting traffic flow o…
☆26Mar 22, 2021Updated 5 years ago
louiskirsch / metagenrl
View on GitHub
MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…
☆71Jun 5, 2020Updated 6 years ago
YanklQYXing / Lottery
View on GitHub
用react-native开发的一个简单跑马灯抽奖demo，使用了react-navigation，可以自定义奖品名称，抽奖定时等
☆19Dec 18, 2017Updated 8 years ago
Rondorf / BOReL
View on GitHub
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 20…
☆31Nov 23, 2021Updated 4 years ago
anassinator / pddp
View on GitHub
WIP implementation of Probabilistic Differential Dynamic Programming in PyTorch
☆16Jul 25, 2024Updated 2 years ago
miguelriemoliveira / RansacPlaneDetection
View on GitHub
A python API for plane detection in point clouds
☆13Apr 22, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
beta-robots / gocator_3100
View on GitHub
Point cloud capture of Gocator3100 device
☆13Dec 20, 2016Updated 9 years ago
guaguakai / decision-focused-RL
View on GitHub
☆16Nov 4, 2021Updated 4 years ago
ondrejbohdal / evograd
View on GitHub
Official PyTorch implementation of "EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization"
☆23Oct 24, 2021Updated 4 years ago
chh105 / MetaPlanner
View on GitHub
MetaPlanner is an open source automated treatment planning method that performs meta-optimization of treatment planning hyperparameters. …
☆14Nov 7, 2023Updated 2 years ago
venezia-antonio / Trajectory-Planning-for-ABB-IRB140
View on GitHub
Trajectory planning for ABB IRB140 industrial manipulator using MATLAB and CoppeliaSim(VREP). The manipulator' task is to spill a bottle …
☆14Feb 8, 2021Updated 5 years ago
chagmgang / pysc2_rl
View on GitHub
☆10Jul 14, 2018Updated 8 years ago
Snow-Dancing / ReinforcementLearning
View on GitHub
利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题
☆15Jul 25, 2019Updated 7 years ago
baggepinnen / TrajectoryLimiters.jl
View on GitHub
Ruckig and nonlinear filters to create dynamically feasible reference trajectories
☆23Jul 4, 2026Updated 3 weeks ago
wyjung0625 / p3s
View on GitHub
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Jan 9, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aryandeshwal / BOPS
View on GitHub
Python implementation of Bayesian optimization over permutation spaces.
☆20Feb 27, 2022Updated 4 years ago
GT-RAIL / clamp
View on GitHub
Combined Learning from Demonstration and Motion Planning
☆14Feb 5, 2019Updated 7 years ago
wupanhao / quadruped_simulation
View on GitHub
quadruped simulation using unitree a1 in pybullet, controller code from stanford pupper
☆15May 19, 2021Updated 5 years ago
nwilliterate / RBF-based-control-for-robot-maniupulator
View on GitHub
rbf network based control for robot manipulator
☆15Feb 21, 2022Updated 4 years ago
Nicny / MS-DWTA-by-MARL
View on GitHub
Multi-agent reinforcement learning methods to solve the multi-ship dynamic weapon-target assignment problem
☆16Jul 25, 2024Updated 2 years ago
VITA-Group / L2O-Training-Techniques
View on GitHub
[NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …
☆29Dec 30, 2021Updated 4 years ago
BotPlayers / BotPlayers
View on GitHub
Play with agents and more.
☆22Sep 18, 2023Updated 2 years ago