I used this paper as inspiration https://arxiv.org/pdf/1904.03367.pdf
☆36Mar 10, 2026Updated 2 months ago
Alternatives and similar repositories for self-attention-ppo-pytorch
Users that are interested in self-attention-ppo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Daily Paper Reading☆24Jan 17, 2026Updated 4 months ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- ☆24Oct 14, 2023Updated 2 years ago
- Accepted by AROB 2021. A car-agent navigates in complex traffic conditions by Mixed_Input_PPO_CNN_LSTM model.☆14May 22, 2021Updated 5 years ago
- Deep Implicit Coordination Graphs☆45May 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Communication-efficient MARL for CACC☆27Aug 7, 2023Updated 2 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- 使用PPO算法+OU噪声进行机械臂轨迹规划仿真☆18May 10, 2024Updated 2 years ago
- The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning☆11Oct 31, 2021Updated 4 years ago
- Code for IEEE transactions on neural networks and learning system☆13Jun 18, 2021Updated 4 years ago
- implementation of MADDPG using PyTorch and multiagent-particle-envs☆39May 11, 2022Updated 4 years ago
- Implementation of SNAIL(A Simple Neural Attentive Meta-Learner) with Gluon☆12Feb 22, 2019Updated 7 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆12Jul 29, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Dec 30, 2023Updated 2 years ago
- ☆11Aug 13, 2020Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- MRC-LSTM: A Hybrid Approach of Multi-scale Residual CNN and LSTM to Predict Bitcoin Price☆11Jun 13, 2022Updated 3 years ago
- PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"☆10Nov 22, 2019Updated 6 years ago
- The dataset contains the vehicle trajectory data perceived by the roadside perception system deployed at the signalized intersections and…☆20Jan 12, 2024Updated 2 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Real time image capture+DQN path planning☆12May 29, 2023Updated 3 years ago
- an implementation of ATOC☆14Dec 6, 2021Updated 4 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆32Jan 19, 2023Updated 3 years ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- Multi-agent Deep Reinforcement Learning for Efficient Computation Offloading in Mobile Edge Computing☆14Jun 7, 2023Updated 2 years ago
- This project uses LSTM and Convolutional time series models to predict and forecast Google and Alibaba cluster traces☆10Dec 4, 2020Updated 5 years ago
- This repo contains the implementation of deep reinforcement learning (DRL) algorithms for virtual machine rescheduling in data centers.☆12Dec 2, 2022Updated 3 years ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆45Dec 31, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- A framework that exploits the potentials of distributed federated learning and double deep Q-networks to minimize joint energy and delay …☆11Apr 21, 2021Updated 5 years ago
- PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)☆16Oct 7, 2020Updated 5 years ago
- The source code of paper "Decentralized Neighbouring Information Fusion for Traffic Network Signal Control" and related baselines.☆22Apr 30, 2024Updated 2 years ago
- Autonomous driving agent in Carla simulator leveraging IL and RL techniques.☆28Dec 31, 2024Updated last year
- Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration☆17Jan 7, 2021Updated 5 years ago
- ☆11Apr 21, 2022Updated 4 years ago