Policy gradient reinforcement learning algorithm with importance sampling
☆33Oct 6, 2017Updated 8 years ago
Alternatives and similar repositories for policy-gradient-importance-sampling
Users that are interested in policy-gradient-importance-sampling are comparing it to the libraries listed below
Sorting:
- ☆11Sep 1, 2017Updated 8 years ago
- Improved Training of Wasserstein GANs for Neural Machine Translation☆11Dec 11, 2017Updated 8 years ago
- weekly reinforcement learning paper reviews☆33Jan 8, 2018Updated 8 years ago
- Reinforcement Leanring for Tetris☆19Oct 24, 2016Updated 9 years ago
- Getting Starting with NIMBUS-CORE☆10Dec 16, 2023Updated 2 years ago
- Code for the paper "Curriculum Dropout", ICCV 2017☆26May 2, 2018Updated 7 years ago
- PyTorch Implementation of REINFORCE for both discrete & continuous control☆266Apr 16, 2017Updated 8 years ago
- implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient☆33Feb 4, 2017Updated 9 years ago
- Demos and tutorials around Torch7 (in progress updates from xLearn)☆43Feb 15, 2017Updated 9 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- ☆38Mar 6, 2017Updated 9 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- A Python implementation of the Viterbi Algorithm with Bigram Hidden Markov Model(HMM) taggers for predicting Parts of Speech(POS) tags. -…☆12Feb 9, 2016Updated 10 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- Convolutional Sequence-to-Sequence (Work in Progress)☆10May 22, 2017Updated 8 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago
- Crow Middleware for Configuring HTTP Security Headers☆10Sep 30, 2015Updated 10 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 2 years ago
- Quicksilver superpage management system☆11May 14, 2021Updated 4 years ago
- rust-libp2p examples☆11Dec 28, 2022Updated 3 years ago
- ☆11Jul 9, 2023Updated 2 years ago
- Labeled sentences from IMDb movie reviews☆10Jul 10, 2017Updated 8 years ago
- Code for the paper "Learning Step-Size Adaptation in CMA-ES"☆12Mar 24, 2023Updated 2 years ago
- ☆32Jan 30, 2026Updated last month
- Variational Bayes for NN in Torch7 (http://papers.nips.cc/paper/4329-practical-variational-inference-for-neural-networks.pdf)☆10Mar 23, 2015Updated 10 years ago
- Prototype code for paper: Adversarial Generalized Method of Moments, Greg Lewis and Vasilis Syrgkanis☆12Oct 21, 2020Updated 5 years ago
- Sign in with Ethereum for Next.js☆17Nov 6, 2022Updated 3 years ago
- ☆13Jul 10, 2020Updated 5 years ago
- Validation Generation for Kubeflow CRD on Kubernetes☆11Jan 25, 2021Updated 5 years ago
- docker-compose example with loadbalancing☆15Apr 18, 2015Updated 10 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- MADDPG agent with collaboration and competition☆12Nov 9, 2018Updated 7 years ago
- ☆12May 12, 2016Updated 9 years ago
- Telstra Kaggle Competition☆10Mar 1, 2016Updated 10 years ago
- ☆11Mar 30, 2016Updated 9 years ago
- Implement BinaryNet of CNN with chainer☆11May 5, 2016Updated 9 years ago