zoulixin93 / pseudo_dyna_qView external linksLinks
☆14Jun 6, 2020Updated 5 years ago
Alternatives and similar repositories for pseudo_dyna_q
Users that are interested in pseudo_dyna_q are comparing it to the libraries listed below
Sorting:
- ☆41Nov 16, 2022Updated 3 years ago
- ☆11Feb 22, 2019Updated 6 years ago
- Recommendation System using Deep Q-Networks and Double Deep Q-Networks☆13May 23, 2020Updated 5 years ago
- Implementation for our paper in NeurIPS 2019☆48Dec 18, 2019Updated 6 years ago
- Deep Reinforcement Learning for Movies Recommendation System☆83Jan 5, 2020Updated 6 years ago
- Sequential☆24Jun 3, 2022Updated 3 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Jul 8, 2020Updated 5 years ago
- ☆31Apr 21, 2021Updated 4 years ago
- Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"☆131Sep 10, 2019Updated 6 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…☆12Dec 31, 2017Updated 8 years ago
- This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…☆11Oct 8, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Visualize subsets of PSL(2,R) in exterior solid torus model☆13Aug 20, 2020Updated 5 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Simple RNN test with TensorFLow☆13May 22, 2018Updated 7 years ago
- Open AI Gym environment of the Missile Command Atari game.☆14May 23, 2023Updated 2 years ago
- Patient data simulator following the structure of an open-ai gym.☆11Jul 9, 2019Updated 6 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 4 years ago
- ☆10Feb 2, 2023Updated 3 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- Our first-year mathematics graduate school notes☆10Dec 20, 2021Updated 4 years ago
- gridslam from OpenSLAM.org☆13May 15, 2018Updated 7 years ago
- ☆10Jan 29, 2021Updated 5 years ago
- This is a tutorial of using Kubeflow to build model, train model and deploy model serving.☆14Nov 22, 2022Updated 3 years ago
- Code associated with the project http://predimportance.mit.edu/☆12Aug 7, 2020Updated 5 years ago
- Spectral Alignment of Graphs☆11Mar 26, 2017Updated 8 years ago
- Solving the card game 6 nimmt! with reinforcement learning☆14Dec 31, 2021Updated 4 years ago
- ☆11Dec 23, 2025Updated last month
- python pytorch……note☆10Nov 16, 2018Updated 7 years ago
- Explore Fibonacci, Galois, and State Space Linear Feedback Shift Register (LFSR) sequence generators☆12Dec 29, 2020Updated 5 years ago
- CaDiCaL + neural glue variable predictions☆10Oct 21, 2020Updated 5 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- ☆12May 2, 2022Updated 3 years ago
- A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.☆40Nov 26, 2019Updated 6 years ago
- An unofficial pytorch implementation of MELU☆46Aug 7, 2024Updated last year
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year