xtma/simple-pytorch-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xtma/simple-pytorch-rl)

xtma / simple-pytorch-rl

Reinforcement Learning Methods with PyTorch

☆38

Alternatives and similar repositories for simple-pytorch-rl

Users that are interested in simple-pytorch-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / incubator-retired-edgent-samples
View on GitHub
Mirror of Apache Edgent (Incubating) Samples
☆15Feb 14, 2018Updated 8 years ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
britig / Hierarchical-Program-Triggered-RL
View on GitHub
This folder contains the experiments and code for Hierarchical Program Triggered RL paper
☆14Jan 7, 2022Updated 4 years ago
amitp-ai / Deep_Reinforcement_Learning
View on GitHub
Udacity's Deep Reinforcement Learning Nano-Degree
☆17Feb 8, 2021Updated 5 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
shanest / replicator_dynamic_examples
View on GitHub
Simple code for running and visualizing replicator dynamics
☆11Jan 31, 2024Updated 2 years ago
XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
pdvelez / ml_soccer
View on GitHub
Soccer toy example simulator used in Reinforcement Learning
☆12Mar 11, 2018Updated 8 years ago
franksh / EpiCommute
View on GitHub
Simulate an epidemic metapopulation model with mobility-reducing containment strategies
☆11Aug 27, 2020Updated 5 years ago
udacity / MLND_CN_P5_Reinforcement_Learning
View on GitHub
nd009-cn-advanced-p5，针对Udacity CN MLND P5项目
☆14Jun 27, 2022Updated 4 years ago
xiyanxiongnico / AMPGen
View on GitHub
☆13May 18, 2025Updated last year
katyayn / Particle-Swarm-Optimization-for-Job-Shop-Scheduling
View on GitHub
Particle Swarm Optimization for Combinatorial Job Shop Scheduling Problem
☆12Dec 13, 2018Updated 7 years ago
plibin / epi-rl
View on GitHub
☆14Jun 21, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
UNITES-Lab / AgentSymbiotic
View on GitHub
☆14Mar 11, 2025Updated last year
cassandra / pomdp-solve
View on GitHub
Software for performing value iteration on partially observable Markov decision processes (POMDPs).
☆17Feb 2, 2024Updated 2 years ago
colinlee0924 / dqn-agv-dispatching
View on GitHub
☆13Aug 17, 2020Updated 5 years ago
jim-meyer / lottery_ticket_pruner
View on GitHub
(Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning
☆10Dec 8, 2022Updated 3 years ago
NicksSimulationsROS / multi_jackal
View on GitHub
ROS packages for simulating multiple Jackals in Gazebo
☆16Apr 25, 2018Updated 8 years ago
deepomicslab / Smart5UTR
View on GitHub
MTAE-based 5UTR design model
☆10Jan 23, 2024Updated 2 years ago
acumos / documentation
View on GitHub
☆18Jun 10, 2022Updated 4 years ago
Rohan138 / marl-baselines3
View on GitHub
Multi-Agent Reinforcement Learning with Stable-Baselines3
☆20Dec 3, 2021Updated 4 years ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hchkaiban / CarRacingRL_DDQN
View on GitHub
Reinforcement Learning for Gym CarRacing-v0 game (Double Deep Q Network)
☆19Apr 11, 2018Updated 8 years ago
JuliaReinforcementLearning / ReinforcementLearningBase.jl-Archive
View on GitHub
☆25May 6, 2021Updated 5 years ago
KJha02 / crossEnvCooperation
View on GitHub
Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"
☆22Sep 12, 2025Updated 10 months ago
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
alec-tschantz / planet
View on GitHub
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Feb 13, 2020Updated 6 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 8 years ago
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
LucasAlegre / sac-plus
View on GitHub
Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
zaixizhang / MolCode
View on GitHub
Chemical Science 2023: An equivariant generative framework for molecular graph-structure Co-design
☆10Jun 18, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
haotian-liu / transformers_llava
View on GitHub
☆16Apr 28, 2023Updated 3 years ago
Mostafa-Samir / 2048-RL-DRQN
View on GitHub
An attempt at applying Deep RL on the board game 2048
☆17Jan 5, 2017Updated 9 years ago
tqjxlm / Simple-DQN-Pytorch
View on GitHub
A simplistic implementation of DQN that works under CartPole-v0 with rendered pixels as input
☆13Feb 28, 2019Updated 7 years ago
AI-secure / Characterizing-Audio-Adversarial-Examples-using-Temporal-Dependency
View on GitHub
ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".
☆11Apr 3, 2019Updated 7 years ago
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
bentoml / BentoSentenceTransformers
View on GitHub
how to build a sentence embedding application using BentoML
☆15Jul 14, 2026Updated last week
JuliaReinforcementLearning / ReinforcementLearningCore.jl
View on GitHub
☆26May 6, 2021Updated 5 years ago