强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)
☆32Aug 8, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcment-Leanring-algorithm
Users that are interested in Reinforcment-Leanring-algorithm are comparing it to the libraries listed below
Sorting:
- ☆14Jul 4, 2022Updated 3 years ago
- Fork of Microsoft/LightGBM to include support for the CEGB (Cost Efficient Gradient Boosting) algorithm. Original repository at https://g…☆13Jun 30, 2017Updated 8 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- Simulation Study of Double Threshold Energy Detection Method for Cognitive Radios☆14Aug 11, 2018Updated 7 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- Code for ICRA 2018 paper - Interactive Robot Knowledge Patching using Augmented Reality☆14Aug 22, 2018Updated 7 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- MSc Informatics dissertation project - University of Edinburgh: Curiosity in Multi-Agent Reinforcement Learning☆13Aug 16, 2019Updated 6 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 6 years ago
- 算法工程师技术栈学习笔记☆15Aug 22, 2022Updated 3 years ago
- Code for CVPR22 - Motron: Multimodal Probabilistic Human Motion Forecasting☆18Jun 4, 2024Updated last year
- A safe and efficient autonomous driving algorithm. Winner of the 2019 DriveML Huawei Autonomous Vehicles Challenge. Built using RLLib and…☆18Jan 24, 2020Updated 6 years ago
- Fast python library encapsulating the nfqueue netlink interface.☆18Sep 3, 2024Updated last year
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- [MIG2021] Deep Reinforcement Learning with Particle Filtering Policy Network for Physics-Based Character Control☆17Feb 25, 2022Updated 4 years ago
- ☆14Sep 1, 2021Updated 4 years ago
- The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.☆20Oct 28, 2022Updated 3 years ago
- UCI chess playing engine derived from Stockfish and LeelaChess Zero☆18Aug 19, 2018Updated 7 years ago
- SIA - C++/Python library for model-based stochastic estimation and optimal control☆23Apr 3, 2024Updated last year
- ☆20Jun 7, 2020Updated 5 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- An implementation of the traffic simulation optimisation with reinforcement learning, with FLOW and SUMO.☆17Jan 15, 2021Updated 5 years ago
- ☆18Jun 26, 2018Updated 7 years ago
- PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))☆14Mar 22, 2019Updated 6 years ago
- A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.☆21Nov 6, 2019Updated 6 years ago
- A reinforcement learning based behaviour planner for autonomous driving agents☆20Jan 8, 2021Updated 5 years ago
- ☆23Aug 26, 2024Updated last year
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆21Feb 29, 2024Updated 2 years ago
- Code to replicate the Representation Noising paper and tools for evaluating defences against harmful fine-tuning☆23Dec 12, 2024Updated last year
- OpenLock Environment for OpenAI Gym☆19Feb 16, 2021Updated 5 years ago
- This contains joint channel and power allocation scheme for a full duplex cognitive radio network underlying a cellular network☆25Oct 20, 2017Updated 8 years ago
- This is a part of MATLAB implementation of the paper "Machine Learning Techniques for Cooperative Spectrum Sensing in Cognitive Radio Net…☆24Oct 1, 2020Updated 5 years ago
- 基于深度强化学习DQN的FlappyBird游戏AI开发☆15Aug 12, 2019Updated 6 years ago
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Apr 21, 2025Updated 10 months ago
- MATLAB files of modulation classification in cognitive radios☆24Jul 12, 2016Updated 9 years ago
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆25Feb 21, 2020Updated 6 years ago
- Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.☆23Aug 27, 2022Updated 3 years ago