feidieufo/RL-Implementation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/feidieufo/RL-Implementation)

feidieufo / RL-Implementation

simple code to reinforcement learning

☆20

Alternatives and similar repositories for RL-Implementation

Users that are interested in RL-Implementation are comparing it to the libraries listed below

Sorting:

XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
ltricot / zerosum
View on GitHub
CFR implementation of a poker bot.
☆12Feb 17, 2023Updated 3 years ago
ShibiHe / Poker-Fictitious-Play
View on GitHub
Fictitious Self-play & Reinforcement Learning
☆18Jan 26, 2018Updated 8 years ago
proroklab / HetGPPO
View on GitHub
Heterogeneous Multi-Robot Reinforcement Learning
☆66Nov 10, 2025Updated 3 months ago
hijkzzz / reinforcement-learning-trading-robot
View on GitHub
Trading Robot based on LSTM-PPO
☆28Dec 27, 2019Updated 6 years ago
raharth / PyMatch
View on GitHub
A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms
☆13Dec 15, 2022Updated 3 years ago
grantsrb / PyTorch-A2C
View on GitHub
General implementation of Advantage Actor Critic using Pytorch
☆28Dec 7, 2021Updated 4 years ago
YiyangZYY / MARL_based_Optimization_for_Cell-Free_Massive_MIMO_Systems
View on GitHub
☆15May 20, 2025Updated 9 months ago
zhisbug / ray-scalable-ml-design
View on GitHub
Some microbenchmarks and design docs before commencement
☆12Feb 1, 2021Updated 5 years ago
MahmutAgrali / DDPG-DQN-PD-Controller-for-VTOL
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
Kyziridis / BipedalWalker-v2
View on GitHub
Solving openAI's game 'BipedalWalker-v2' with Deep Reinforcement Learning
☆27May 26, 2020Updated 5 years ago
MichaelUnknown / mkpoker
View on GitHub
A Texas Holdem poker framework written in C++ 20.
☆11Apr 23, 2023Updated 2 years ago
Misaki-Akeno / minimind-v-vla
View on GitHub
🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
☆27Oct 16, 2025Updated 4 months ago
cxiang26 / Simple_GAN
View on GitHub
GAN: An example for generating Gaussian distribution by a simple generating adversarial network.
☆12Dec 28, 2020Updated 5 years ago
JayaniP / Multi_Agent-LLM
View on GitHub
Enhancing Multi-Agent System Coordination in Autonomous Electric Vehicles Using Large Language Models
☆20Dec 13, 2023Updated 2 years ago
sean1295 / DiffDAgger
View on GitHub
☆20Mar 10, 2025Updated 11 months ago
marcoschouten / planning-and-decision-making
View on GitHub
RO47005 Planning & Decision Making. Quadrotor model planner using probabilistic roadmap (PRM) and collision avoidance using Velocity Obst…
☆10Feb 28, 2022Updated 4 years ago
kerthcet / k8s-specific-knowledge-base
View on GitHub
A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.
☆10Sep 14, 2023Updated 2 years ago
wangrongding / folder-print
View on GitHub
🌿快速生成文件夹目录结构，支持定义目录层级，支持生成到 markdown 文件。
☆13Oct 19, 2022Updated 3 years ago
zhang-guangyi / HJSCC
View on GitHub
This is a pytorch implementation of our AAAI paper for learned image transmission with HVAE
☆10Aug 8, 2025Updated 6 months ago
jjgonde / Alicante-Murcia-SUMO-Scenario
View on GitHub
Calibrated Alicante-Murcia Freeway SUMO Scenario
☆11Nov 28, 2019Updated 6 years ago
AIandGlobalDevelopmentLab / eo-poverty-review
View on GitHub
Awesome papers on Earth Observation (EO), Machine Learning (ML), and Causal Inference (CI) [Edward Elgar Publishing]
☆11Jan 18, 2026Updated last month
CMU-TBD / Group_based_navigation_v1
View on GitHub
Code for paper "Group-based Motion Prediction for Navigation in Crowded Environments"
☆13Feb 19, 2025Updated last year
CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
View on GitHub
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Mar 9, 2018Updated 7 years ago
fanshiliang / Hierarchical-Deep-Reinforcement-Learning
View on GitHub
paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation
☆10Mar 27, 2018Updated 7 years ago
tpoisonooo / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆12Mar 24, 2025Updated 11 months ago
transparent-framework / optimize-ride-sharing-earnings
View on GitHub
A GitHub repository associated with paper "Learn to Earn: Enabling Coordination Within a Ride-Hailing Fleet"
☆10Jun 22, 2020Updated 5 years ago
teguhSL / optimal_control_distribution
View on GitHub
Repository for computing the probability distribution of an optimal control problem
☆11Oct 4, 2021Updated 4 years ago
SageCao1125 / GPC
View on GitHub
[ICLR 2026] General Policy Composition (GPC)
☆30Jan 29, 2026Updated last month
TinaMenke / Deep-Reinforcement-Learning
View on GitHub
Deep Reinforcement Learning with continuous control in CARLA
☆11Dec 8, 2022Updated 3 years ago
Cothrax / deepfool
View on GitHub
CFR-based Texas Hold'em AI
☆11Jan 30, 2021Updated 5 years ago
Alekat13 / Deep-Reinforcement-Ant-Colony-Optimization-
View on GitHub
Swarm learning algorithm
☆11Jun 2, 2021Updated 4 years ago
hanizaidi110 / Opponent-Modeling-and-Predicting-Opponent-moves-in-Poker
View on GitHub
Advanced_Data_Integration_Project
☆11Jul 31, 2018Updated 7 years ago
worldveil / gopoker
View on GitHub
Poker hand evaluation for Go
☆12Feb 7, 2014Updated 12 years ago
ZhongZ-Wang / Model-Based-RL
View on GitHub
这是一个关于基于模型的强化学习的资料，包括一些代码地址、paper、slide等。
☆46Aug 22, 2020Updated 5 years ago
queenxy / DMLoco
View on GitHub
☆26Jul 14, 2025Updated 7 months ago
amap-cvlab / world-env
View on GitHub
☆24Oct 31, 2025Updated 4 months ago
HYLZ-2019 / EFS
View on GitHub
Code for the paper All-in-focus Imaging from Event Focal Stack, CVPR 2023.
☆13Oct 3, 2025Updated 5 months ago
clear-nus / ltldog
View on GitHub
☆13Dec 17, 2025Updated 2 months ago