godmoves/reinforcement_learning_collections

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/godmoves/reinforcement_learning_collections)

godmoves / reinforcement_learning_collections

A collection of deep reinforcement learning algorithm implementations

☆11

Alternatives and similar repositories for reinforcement_learning_collections

Users that are interested in reinforcement_learning_collections are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sail-sg / rosmo
View on GitHub
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
☆30Jul 18, 2023Updated 3 years ago
stetelepta / setdetection
View on GitHub
Detection valid SET combinations from images with SET-cards
☆12Dec 8, 2022Updated 3 years ago
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
View on GitHub
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
RaghadAlshaikh / Automatic-Arabic-Text-Summarizer
View on GitHub
Automatic Arabic Text Summarization using Python
☆12Jul 2, 2020Updated 6 years ago
labrijisaad / Youtube-video-transcriptor
View on GitHub
In this notebook, I implemented a script to transcribe YouTube videos (and audio files in general) using Google's speech-to-text API.
☆17Dec 19, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HebiRobotics / hebi-python-examples
View on GitHub
Examples for the HEBI Robotics Python API
☆14Jul 13, 2026Updated 2 weeks ago
ginop / reconchess-strangefish
View on GitHub
☆10Jan 24, 2022Updated 4 years ago
JuliaPOMDP / FactoredValueMCTS.jl
View on GitHub
Scalable MCTS for team scenarios
☆17Jun 14, 2024Updated 2 years ago
Juxiann / KuhnPoker
View on GitHub
An implementation of CFR algorithm to solve Kuhn Poker.
☆14Feb 6, 2020Updated 6 years ago
MUmarJaved / MultiAgent-Distributed-Reinforcement-Learning
View on GitHub
☆20Sep 14, 2019Updated 6 years ago
plter / LuaLessons20131126
View on GitHub
☆16Nov 26, 2013Updated 12 years ago
JBLanier / stratego_env
View on GitHub
Multi-Agent RL Environment for the Stratego Board Game (and variants)
☆34Jun 30, 2026Updated 3 weeks ago
madvn / DDPG
View on GitHub
Deep Deterministic Policy Gradients in TF r2.0
☆13Feb 6, 2020Updated 6 years ago
waterhorse1 / NAC
View on GitHub
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
chagmgang / pysc2_rl
View on GitHub
☆10Jul 14, 2018Updated 8 years ago
jiayu-ch15 / Variational-Automatic-Curriculum-Learning
View on GitHub
curriculum
☆27Feb 7, 2023Updated 3 years ago
tilarids / reinforcement_learning_playground
View on GitHub
Playground for reinforcement learning algorithms implemented in TensorFlow
☆16Oct 18, 2016Updated 9 years ago
malayandi / Tiger-Problem-POMDP
View on GitHub
Implementation of POMDP algorithms on the tiger example, as described in Littman, Cassandra and Kaelbling (1994).
☆17Aug 8, 2017Updated 8 years ago
reconnaissanceblindchess / reconchess
View on GitHub
ReconChess python implementation
☆41Feb 17, 2022Updated 4 years ago
facebookresearch / diplomacy_searchbot
View on GitHub
Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021
☆52Aug 27, 2022Updated 3 years ago
BotPlayers / BotPlayers
View on GitHub
Play with agents and more.
☆22Sep 18, 2023Updated 2 years ago
martinballa / PyTAG
View on GitHub
☆26Jul 21, 2026Updated last week
PickNikRobotics / ros_reflexxes
View on GitHub
Reflexxes Type II provides acceleration-limited trajectory smoothing
☆11Feb 3, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LorenzoBalandi / Optimal-Control-of-a-Robotic-Manipulator
View on GitHub
Course project: optimal control of a 2 dof manipulator exploiting the DDP algorithm
☆13Mar 4, 2022Updated 4 years ago
warehouse-picking-automation-challenges / team_pfn
View on GitHub
☆10Aug 25, 2016Updated 9 years ago
shaman-ai / llambdao
View on GitHub
Large Language Agents Modulating Behaviour in Decentralized Autonomous Organizations
☆24Jul 14, 2023Updated 3 years ago
sloretz / gazebo_model_path_example
View on GitHub
Example using package.xml to set gazebo model paths
☆12Sep 30, 2018Updated 7 years ago
xiaojudou / ICML2017_arXiv
View on GitHub
ICML 2017 accepted papers on arXiv.org
☆17May 25, 2017Updated 9 years ago
ddugovic / BayesianElo
View on GitHub
Bayesian Elo Rating estimator
☆15Jul 2, 2015Updated 11 years ago
paulcjh / gpt-j-6b
View on GitHub
☆50Jan 4, 2023Updated 3 years ago
starry-sky6688 / DyMA-CL
View on GitHub
Implementation of DyMA-CL, MARL algorithm
☆30Apr 18, 2020Updated 6 years ago
xihuai18 / A2PO-ICLR2023
View on GitHub
Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)
☆32Nov 22, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ctmakro / canton
View on GitHub
yet another DL framework
☆11Oct 28, 2018Updated 7 years ago
camel-ai / camel_chat
View on GitHub
💬 Minimalistic repository to reproduce and serve CAMEL models.
☆24Jun 26, 2023Updated 3 years ago
google-deepmind / diplomacy
View on GitHub
☆60Apr 22, 2024Updated 2 years ago
liubenyuan / pyBSBL
View on GitHub
The python implementation of the BSBL algorithms for block sparse signal recovery
☆17Feb 17, 2022Updated 4 years ago
VDT-2023 / VDT
View on GitHub
☆10May 24, 2023Updated 3 years ago
wbernoudy / pygarble
View on GitHub
Garbled circuits in Python
☆25Jun 1, 2017Updated 9 years ago
google-deepmind / lm_act
View on GitHub
LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations
☆30May 21, 2025Updated last year