wangbx66 / differentially-private-q-learningLinks
☆13Updated 6 years ago
Alternatives and similar repositories for differentially-private-q-learning
Users that are interested in differentially-private-q-learning are comparing it to the libraries listed below
Sorting:
- ☆18Updated 4 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Updated 5 years ago
- Hao Jin, Yang Peng, Wenhao Yang, Shusen Wang and Zhihua Zhang. Federated Reinforcement Learning with Environment Heterogeneity. AISTATS, …☆63Updated 3 years ago
- ☆39Updated 3 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Updated 4 years ago
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Updated 4 years ago
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆75Updated 5 years ago
- Implementation of "Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning" (https://arxiv.org/pdf/1712.08266.pdf)☆38Updated 7 years ago
- ☆42Updated 6 years ago
- Federated posterior averaging implemented in JAX☆53Updated 2 years ago
- Minimax Optimization, Stackelberg Games, Generative Adversarial Networks☆19Updated 5 years ago
- FEN Code☆40Updated 6 years ago
- This repository contains a simple implementation of Interval Bound Propagation (IBP) using TensorFlow: https://arxiv.org/abs/1810.12715☆161Updated 6 years ago
- Modular PyTorch implementation of policy gradient methods☆25Updated 7 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated 2 years ago
- ☆28Updated 5 years ago
- PyTorch implementation of efficient algorithms for DRO with CVaR and Chi-Square uncertainty sets☆64Updated 3 years ago
- Reinforcement Learning with Perturbed Reward, AAAI 2020☆30Updated last year
- Study NeuralUCB and regret analysis for contextual bandit with neural decision☆99Updated 4 years ago
- [ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…☆15Updated 4 years ago
- Code for our paper on doing resource allocation with graph neural networks☆32Updated 4 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆139Updated 4 years ago
- [NeurIPS 2020, Spotlight] State-Adversarial DQN (SA-DQN) for robust deep reinforcement learning☆35Updated 4 years ago
- ☆22Updated 6 years ago
- ☆27Updated 2 years ago
- ☆16Updated 2 years ago
- ☆126Updated last year
- ☆12Updated 5 years ago
- LipSDP - Lipschitz Estimation for Neural Networks☆71Updated 3 years ago
- Variance Reduction for Reinforcement Learning in Input-Driven Environments (ICLR '19)☆31Updated 6 years ago