astariul / encode-attend-navigate-pytorchLinks
Encode-attend-navigate unofficial Pytorch implementation
β11Updated 9 months ago
Alternatives and similar repositories for encode-attend-navigate-pytorch
Users that are interested in encode-attend-navigate-pytorch are comparing it to the libraries listed below
Sorting:
- π§© Create your own puzzle, use my agents to solve it π€ try them out! π§©β9Updated 3 years ago
- β25Updated 3 years ago
- Toy environment set for multi-agent reinforcement learning and moreβ39Updated 7 months ago
- β10Updated last week
- Code for TSP Transformerβ186Updated 4 years ago
- Population-Based Reinforcement Learning for Combinatorial Optimizationβ78Updated last year
- Official Implementation of Mementoβ18Updated 7 months ago
- β9Updated 4 years ago
- Minimal implementation of multi-agent reinforcement learning algorithmsβ56Updated 3 years ago
- Implementation of Pointer Networks using PyTorchβ62Updated last year
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.β18Updated 3 years ago
- Appendix repository for Medium article "Routing Traveling Salesmen on Random Graphs using Reinforcement Learning, inΒ PyTorch"β58Updated 5 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Verticesβ23Updated 4 years ago
- This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).β33Updated 5 years ago
- Hierarchical deep reinforcement learning for combinatorial optimization problemβ35Updated 5 years ago
- Study NeuralUCB and regret analysis for contextual bandit with neural decisionβ95Updated 3 years ago
- β11Updated 2 years ago
- β122Updated 2 months ago
- Pointer NN differs from the previous attention attempts in that, instead of using attention to weight hidden units of an encoder, it usesβ¦β42Updated 4 years ago
- β55Updated 2 years ago
- AGAC: Adversarially Guided Actor-Criticβ48Updated 3 years ago
- β¨π² Hierarchical extreme multiclass and multi-label classification.β17Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β57Updated 2 years ago
- The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'β85Updated 4 years ago
- Reinforcement Learning Methods with PyTorchβ39Updated 5 years ago
- Gym environment for playing Wordle with RL agentsβ39Updated 3 years ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- β27Updated last year
- Tabular methods for reinforcement learningβ38Updated 5 years ago
- Episodic Policy Gradient Trainingβ14Updated 3 years ago