ryoungj/ZO-L2L

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ryoungj/ZO-L2L)

ryoungj / ZO-L2L

[ICLR'20] Learning to Learn by Zeroth-Order Oracle

☆14

Alternatives and similar repositories for ZO-L2L

Users that are interested in ZO-L2L are comparing it to the libraries listed below

Sorting:

wyjung0625 / p3s
View on GitHub
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Jan 9, 2020Updated 6 years ago
optimization-toolbox / DNE4py
View on GitHub
DNE4py is a python library that aims to run and visualize many different evolutionary algorithms with high performance using mpi4py. It a…
☆10Oct 13, 2020Updated 5 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 6 years ago
isl-org / LMRS
View on GitHub
Source code for ICLR 2020 paper: "Learning to Guide Random Search"
☆39Sep 2, 2024Updated last year
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
KyunghyunLee / aes-rl
View on GitHub
☆17Dec 12, 2020Updated 5 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆46Oct 29, 2020Updated 5 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 5 years ago
dannysdeng / dqn-pytorch
View on GitHub
PyTorch - Implicit Quantile Networks - Quantile Regression - C51
☆22Jul 26, 2019Updated 6 years ago
pierresegonne / VINF
View on GitHub
Repository for DTU Special Course, focusing on Variational Inference using Normalizing Flows (VINF). Supervised by Michael Riis Andersen
☆26Jun 11, 2020Updated 5 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 6 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
mnicely / gtc_fall
View on GitHub
GPU Optimization for Python
☆10Mar 13, 2021Updated 4 years ago
ertsiger / induction-subgoal-automata-rl
View on GitHub
Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…
☆14Aug 15, 2023Updated 2 years ago
TiFu / truck_backer_upper
View on GitHub
☆10Jan 4, 2023Updated 3 years ago
icaros-usc / overcooked_env_gen
View on GitHub
Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.
☆16May 1, 2024Updated last year
suzana-ilic / MLMath
View on GitHub
Compressed ML Math material based on the book "Mathematics for Machine Learning" and other resources.
☆10Jan 7, 2020Updated 6 years ago
Ktakuya332C / deepcube
View on GitHub
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Dec 9, 2018Updated 7 years ago
mossr / CrossEntropyVariants.jl
View on GitHub
Cross-entropy method variants for optimization in Julia
☆12Apr 29, 2021Updated 4 years ago
marcharper / pyed
View on GitHub
Computes trajectories for evolutionary dynamics.
☆15Oct 6, 2020Updated 5 years ago
sukunis / CUNFFT
View on GitHub
Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)
☆11Apr 30, 2018Updated 7 years ago
mjamroz / PlantRecognition
View on GitHub
Example of android app written in Qt/Qml which uses MXNet for plant image recognition.
☆10Nov 4, 2017Updated 8 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 4 years ago
polixir / d3pe
View on GitHub
D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.
☆11Jun 2, 2022Updated 3 years ago
ericjang / variance_reduction
View on GitHub
Implementation of Variance Reduction Techniques in Julia
☆11Sep 6, 2016Updated 9 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 6 years ago
automl / LTO-CMA
View on GitHub
Code for the paper "Learning Step-Size Adaptation in CMA-ES"
☆12Mar 24, 2023Updated 2 years ago
jpmattern / seir-covid19
View on GitHub
☆12Oct 11, 2022Updated 3 years ago
microsoft / EPPO
View on GitHub
An implementation of effective policy ensemble.
☆16Jul 5, 2023Updated 2 years ago
ezhan94 / calibratable-style-consistency
View on GitHub
☆11Jun 5, 2023Updated 2 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
kaist-silab / design-baselines-fixes
View on GitHub
Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)
☆11Apr 30, 2023Updated 2 years ago
uncharted-technologies / robust-domain-randomization
View on GitHub
Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"
☆12Nov 22, 2022Updated 3 years ago
navyifanr / NdkSample
View on GitHub
some NDK sample
☆11Mar 11, 2018Updated 7 years ago
jinglingli / nn-extrapolate
View on GitHub
☆13Jan 4, 2023Updated 3 years ago
hsjharvey / Reinforcement-Learning
View on GitHub
Reinforcement learning algorithm implementation
☆10Oct 31, 2021Updated 4 years ago
qslim / epcb-gnns
View on GitHub
☆11Jun 21, 2022Updated 3 years ago
hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
View on GitHub
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
☆10Dec 10, 2018Updated 7 years ago