astariul / encode-attend-navigate-pytorchLinks

Encode-attend-navigate unofficial Pytorch implementation

☆11

Alternatives and similar repositories for encode-attend-navigate-pytorch

Users that are interested in encode-attend-navigate-pytorch are comparing it to the libraries listed below

Sorting:

AmineZouitine / RL_Puzzle
🧩 Create your own puzzle, use my agents to solve it 🤖 try them out! 🧩
☆9Updated 3 years ago
lyeskhalil / CORL
☆25Updated 3 years ago
MehdiZouitine / gym_ma_toy
Toy environment set for multi-agent reinforcement learning and more
☆39Updated 7 months ago
ZHANG-NI / AGFN
☆10Updated last week
xbresson / TSP_Transformer
Code for TSP Transformer
☆186Updated 4 years ago
instadeepai / poppy
Population-Based Reinforcement Learning for Combinatorial Optimization
☆78Updated last year
instadeepai / memento
Official Implementation of Memento
☆18Updated 7 months ago
wz26 / OpenGraphGym
☆9Updated 4 years ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆56Updated 3 years ago
ast0414 / pointer-networks-pytorch
Implementation of Pointer Networks using PyTorch
☆62Updated last year
clvrai / agile
Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.
☆18Updated 3 years ago
unit8co / medium-tsp
Appendix repository for Medium article "Routing Traveling Salesmen on Random Graphs using Reinforcement Learning, in PyTorch"
☆58Updated 5 years ago
atavakol / action-hypergraph-networks
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Updated 4 years ago
Spider-scnu / Monte-Carlo-tree-search-for-TSP
This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).
☆33Updated 5 years ago
qiang-ma / HRL-for-combinatorial-optimization
Hierarchical deep reinforcement learning for combinatorial optimization problem
☆35Updated 5 years ago
sauxpa / neural_exploration
Study NeuralUCB and regret analysis for contextual bandit with neural decision
☆95Updated 3 years ago
ha0ransun / Path-Auxiliary-Sampler
☆11Updated 2 years ago
martyput / MDP_book
☆122Updated 2 months ago
Guillem96 / pointer-nn-pytorch
Pointer NN differs from the previous attention attempts in that, instead of using attention to weight hidden units of an encoder, it uses…
☆42Updated 4 years ago
twitter-research / hyperbolic-rl
☆55Updated 2 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
MaxHalford / myriade
✨🌲 Hierarchical extreme multiclass and multi-label classification.
☆17Updated 2 years ago
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 2 years ago
awslabs / or-rl-benchmarks
The source code for the paper: 'ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems'
☆85Updated 4 years ago
xtma / simple-pytorch-rl
Reinforcement Learning Methods with PyTorch
☆39Updated 5 years ago
zach-lawless / gym-wordle
Gym environment for playing Wordle with RL agents
☆39Updated 3 years ago
henry-prior / jax-rl
JAX implementations of core Deep RL algorithms
☆82Updated 3 years ago
kaist-silab / equity-transformer
☆27Updated last year
linesd / tabular-methods
Tabular methods for reinforcement learning
☆38Updated 5 years ago
thaihungle / EPGT
Episodic Policy Gradient Training
☆14Updated 3 years ago