Experiments with transformer based RL algorithms
☆22Nov 23, 2019Updated 6 years ago
Alternatives and similar repositories for Transformer-RL
Users that are interested in Transformer-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- Adaptive Attention Span for Reinforcement Learning☆136May 11, 2020Updated 6 years ago
- ☆23Dec 25, 2024Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Feb 21, 2023Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An implementation of the traffic simulation optimisation with reinforcement learning, with FLOW and SUMO.☆17Jan 15, 2021Updated 5 years ago
- OpenAI Gym Environment for Low-Latency Trading☆20Jun 15, 2018Updated 8 years ago
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆13Mar 25, 2026Updated 3 months ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Carla Multi Agent Deep Reinforcement Learning☆22Nov 27, 2020Updated 5 years ago
- ☆32Dec 1, 2019Updated 6 years ago
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 3 years ago
- This repository contains Python functions for predicting time series.☆15May 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- [Neurocomputing, 2023] Personalized Robotic Control via Constrained Multi-Objective Reinforcement Learning☆28Dec 25, 2023Updated 2 years ago
- NGSIM Driving RL/Imitation learning environment compatible with rllab☆13Feb 23, 2018Updated 8 years ago
- Learning Safety Constraints for Large Language Models (ICML2025)☆35May 25, 2026Updated last month
- Transformer-based Multi-Agent Actor-Critic Framework☆46Jun 8, 2022Updated 4 years ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆13Apr 14, 2024Updated 2 years ago
- A hobby implementation of an ncurses binding for Idris 2☆17Dec 9, 2024Updated last year
- PyTrafficSim is a light traffic simulator for research related purposes. PTS is the most easy way to test your self-driving algorithm wit…☆20Mar 6, 2021Updated 5 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)☆48Nov 30, 2018Updated 7 years ago
- SemiDefinite Programming Algorithm (SDPA) for Python☆12Jan 27, 2025Updated last year
- This repository contains PyTorch implementations of reinforcement learning algorithms. Its purpose is to provide straightforward and easi…☆19Nov 10, 2023Updated 2 years ago
- ResNet implementation in Julia☆12Feb 5, 2022Updated 4 years ago
- The model for edge classification by transforming edges to nodes.☆15Dec 22, 2020Updated 5 years ago
- ☆16Apr 28, 2023Updated 3 years ago
- A data processing module implemented with numpy☆10Aug 16, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆50Jul 30, 2023Updated 2 years ago
- Lyrics for Spotify Android☆10Jun 17, 2019Updated 7 years ago
- ☆16Oct 14, 2021Updated 4 years ago
- A research notes about how to get benefits from Cython to be asynchronous beyond IO tasks☆11Feb 17, 2020Updated 6 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- how to build a sentence embedding application using BentoML☆15Jun 10, 2026Updated 3 weeks ago
- Demo for the subjective interface☆14Mar 4, 2018Updated 8 years ago