Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
☆10Jan 9, 2018Updated 8 years ago
Alternatives and similar repositories for AttentionTRL
Users that are interested in AttentionTRL are comparing it to the libraries listed below
Sorting:
- Comp 781 Project☆10Jan 2, 2026Updated 2 months ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Apr 18, 2019Updated 6 years ago
- Trading Stock with Deep Reinforcement Learning☆24Aug 20, 2018Updated 7 years ago
- Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning☆29Nov 27, 2019Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆25Jun 10, 2018Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Jan 10, 2019Updated 7 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- ☆32Jun 25, 2018Updated 7 years ago
- Python implementation of Association Rule Mining☆11Apr 26, 2024Updated last year
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- Portfolio optimization and index tracking for the FTSE index using genetic algorithm☆12Jan 13, 2018Updated 8 years ago
- ☆11Mar 31, 2024Updated last year
- ☆12Jan 11, 2020Updated 6 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Mar 28, 2021Updated 4 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- Reinforcement Learning, Tutorials in Chinese☆11Jun 9, 2018Updated 7 years ago
- MATLAB implementation of the universal directed information estimators in Jiantao Jiao, Haim H. Permuter, Lei Zhao, Young-Han Kim, and Ts…☆11Apr 2, 2019Updated 6 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- Recently, image classification draw attentions of many researchers. The need of object recognition grows drastically, especially in the …☆11May 14, 2017Updated 8 years ago
- GPU RKF45 (Runge–Kutta–Fehlberg) ODE solver written in pycuda/python3☆10Mar 22, 2019Updated 6 years ago
- JAX implementation of GPTQ quantization algorithm☆10Jul 19, 2023Updated 2 years ago
- We have a Turtlebot simulator which is treated as an autonomous vehicle. Global path planning is applied on this map environment. A webca…☆11Dec 9, 2017Updated 8 years ago
- JAX/Haiku implementation of "Auction Learning as a Two-Player Game"☆11Jul 6, 2024Updated last year
- Mis proyectos de marketing aplicando AI☆11Oct 31, 2025Updated 4 months ago
- This is the issue log for the Roam-Excalidraw plugin.☆10Mar 15, 2021Updated 4 years ago
- Multi-Objective Causal Bayesian Optimisation, a new paradigm for finding Pareto-optimal interventions in multi-outcome causal models☆16Jun 2, 2025Updated 9 months ago
- ☆12Mar 6, 2020Updated 6 years ago
- ☆10Nov 5, 2025Updated 4 months ago
- Portfolio Optimisation is a fundamental problem in Financial Mathematics.The objective of this project is to explore the applicability of…☆13Nov 10, 2020Updated 5 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- Super-Paramagnetic Clustering, Maximum entropy, Maximum Likelihood Methods.☆11Oct 18, 2021Updated 4 years ago
- 🎮 A configurable Breakout environment for reinforcement learning☆11Mar 20, 2018Updated 7 years ago
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- Matlab Implementation of "The Hierarchical Hidden Markov Model: Analysis and Applications"☆22Aug 31, 2011Updated 14 years ago
- A library to create lore plots (logistic regression of the prevalence of a categorical variable in function of a continuous feature)☆18Mar 1, 2026Updated last week
- GridSim is an autonomous driving simulator engine that uses a car-like robot architecture to generate occupancy grids from simulated sens…☆11Mar 24, 2023Updated 2 years ago