Collections of powerful RL architectures with brief introductions.
☆13Nov 20, 2020Updated 5 years ago
Alternatives and similar repositories for Reinforcement-Learning-Platforms
Users that are interested in Reinforcement-Learning-Platforms are comparing it to the libraries listed below
Sorting:
- Welcome to the Battle Simulator, a real-time strategy game where two armies clash on a battlefield. Customize your soldiers, manage resou…☆14Jan 17, 2026Updated 2 months ago
- Battleship environment for reinforcement learning tasks☆14Apr 29, 2023Updated 2 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- LLMs for Wargames☆17Sep 21, 2024Updated last year
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- Economics of Ransomware | Dataset☆15May 2, 2018Updated 7 years ago
- Training and testing pipeline for ransomware classification based on screenshots of the splash screens or ransom notes (https://arxiv.org…☆11Jul 19, 2020Updated 5 years ago
- A bimanual robotics platform combining LeRobot and ManiSkill for advanced dual-arm manipulation tasks using the SO100 robot digital twin.☆37Jun 26, 2025Updated 8 months ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆13Aug 12, 2021Updated 4 years ago
- Version 3.0.0 Pytorch implementations of DQN, DDQN, DDPG, SAC, Discrete SAC. With more features :)☆12Feb 16, 2023Updated 3 years ago
- Nowadays Using machine learning methods at simulations systems has been gaining importance with spreading and growing machine learning me…☆25Nov 4, 2025Updated 4 months ago
- Man in the middle attack demo☆11Jan 14, 2018Updated 8 years ago
- 🦾Set up your embodied LLM agent with the same ease as normal agents in CrewAI or Autogen☆62Updated this week
- [NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…☆10Feb 13, 2022Updated 4 years ago
- Factored Interactive POMDP solver based on symbolic Perseus.☆11Aug 12, 2025Updated 7 months ago
- ☆14Jul 27, 2022Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- This repository is to develop novel AIs for complex C2 decision making. It consists of parallel branches for GUI and for AI development (…☆31Oct 12, 2022Updated 3 years ago
- ☆17Jun 7, 2017Updated 8 years ago
- This repo is created to perform I/O Request Packet (IRP) driven ransomware analysis where the IRP logs were collected during ransomware e…☆11Aug 14, 2020Updated 5 years ago
- ☆15Feb 28, 2020Updated 6 years ago
- 一些用于互联网算法岗面试复习用的常见手撕代码合集:排序算法、最短路算法、二叉树遍历算法、sql语句、nms算法、IOU算法、多头注意力MHA等☆21Mar 18, 2025Updated last year
- Sharing the codebase and steps for artifact evaluation for ISCA 2023 paper☆15Feb 20, 2024Updated 2 years ago
- ☆18Jul 25, 2024Updated last year
- openai-gym style RL benchmark for interconnection network congestion control study☆17May 12, 2022Updated 3 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Rein…☆36Nov 8, 2019Updated 6 years ago
- TRPO Implementation in Tensorflow 2.0 for Reinforcement Learning Project @ Sapienza☆16Mar 25, 2023Updated 2 years ago
- ☆14Sep 27, 2019Updated 6 years ago
- A Linux/Windows Ransomware PoC written in Python, Go and C☆16Jun 17, 2023Updated 2 years ago
- an implementation of ATOC☆14Dec 6, 2021Updated 4 years ago
- meta-MADDPG (Python implementation)☆19Sep 16, 2018Updated 7 years ago
- Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research☆38Apr 11, 2024Updated last year
- Decentralized deep multi-agent reinforcement learning in physical environments.☆14Aug 19, 2018Updated 7 years ago
- A script for sniffing internet traffic between a machine and the gateway in your local network.☆18Dec 26, 2020Updated 5 years ago
- Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning☆20Aug 12, 2021Updated 4 years ago
- Code and Data for AisaCCS 2018 paper: Hardware Performance Counters Can Detect Malware: Myth or Fact?☆23Feb 20, 2026Updated 3 weeks ago
- Official implementation of ISSTA 2022 paper: MDPFuzz: Testing Models Solving Markov Decision Processes.☆24Dec 17, 2022Updated 3 years ago