simple code to reinforcement learning
☆20Aug 30, 2020Updated 5 years ago
Alternatives and similar repositories for RL-Implementation
Users that are interested in RL-Implementation are comparing it to the libraries listed below
Sorting:
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆66Nov 10, 2025Updated 3 months ago
- Trading Robot based on LSTM-PPO☆28Dec 27, 2019Updated 6 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- General implementation of Advantage Actor Critic using Pytorch☆28Dec 7, 2021Updated 4 years ago
- ☆15May 20, 2025Updated 9 months ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning☆27May 26, 2020Updated 5 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- GAN: An example for generating Gaussian distribution by a simple generating adversarial network.☆12Dec 28, 2020Updated 5 years ago
- Enhancing Multi-Agent System Coordination in Autonomous Electric Vehicles Using Large Language Models☆20Dec 13, 2023Updated 2 years ago
- ☆20Mar 10, 2025Updated 11 months ago
- RO47005 Planning & Decision Making. Quadrotor model planner using probabilistic roadmap (PRM) and collision avoidance using Velocity Obst…☆10Feb 28, 2022Updated 4 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- This is a pytorch implementation of our AAAI paper for learned image transmission with HVAE☆10Aug 8, 2025Updated 6 months ago
- Calibrated Alicante-Murcia Freeway SUMO Scenario☆11Nov 28, 2019Updated 6 years ago
- Awesome papers on Earth Observation (EO), Machine Learning (ML), and Causal Inference (CI) [Edward Elgar Publishing]☆11Jan 18, 2026Updated last month
- Code for paper "Group-based Motion Prediction for Navigation in Crowded Environments"☆13Feb 19, 2025Updated last year
- We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting☆12Mar 9, 2018Updated 7 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 11 months ago
- A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"☆10Jun 22, 2020Updated 5 years ago
- Repository for computing the probability distribution of an optimal control problem☆11Oct 4, 2021Updated 4 years ago
- [ICLR 2026] General Policy Composition (GPC)☆30Jan 29, 2026Updated last month
- Deep Reinforcement Learning with continuous control in CARLA☆11Dec 8, 2022Updated 3 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Swarm learning algorithm☆11Jun 2, 2021Updated 4 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Poker hand evaluation for Go☆12Feb 7, 2014Updated 12 years ago
- 这是一个关于基于模型的强化学习的资料,包括一些代码地址、paper、slide等。☆46Aug 22, 2020Updated 5 years ago
- ☆26Jul 14, 2025Updated 7 months ago
- ☆24Oct 31, 2025Updated 4 months ago
- Code for the paper All-in-focus Imaging from Event Focal Stack, CVPR 2023.☆13Oct 3, 2025Updated 5 months ago
- ☆13Dec 17, 2025Updated 2 months ago