KaiYan289/RLpapersnote

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KaiYan289/RLpapersnote)

KaiYan289 / RLpapersnote

☆49

Alternatives and similar repositories for RLpapersnote

Users that are interested in RLpapersnote are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dev-singularity / Inventory-Optimization-Algorithms
View on GitHub
Algorithms Library for Supply Chain Inventory Optimization
☆19Feb 2, 2019Updated 7 years ago
uoe-agents / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆25Oct 26, 2021Updated 4 years ago
lapisrocks / DiscreteAdversarialDistillation
View on GitHub
[NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"
☆11Jun 18, 2024Updated 2 years ago
akazemipour / Distributional-RL
View on GitHub
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
☆26Jun 17, 2025Updated last year
lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
fshamshirdar / gym-airsim
View on GitHub
OpenAI Gym environment for AirSim
☆24Nov 27, 2019Updated 6 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
wingsweihua / presslight
View on GitHub
☆75May 5, 2023Updated 3 years ago
Suryavf / SelfDrivingCar
View on GitHub
☆13Jan 15, 2022Updated 4 years ago
LSTM-Kirigaya / Quadrotor
View on GitHub
使用DDPG算法解决rlschool中无人机悬停控制的问题（内含训练了9个小时的良模型）
☆10Jul 7, 2020Updated 6 years ago
IntologyAI / NanoGPT-Bench
View on GitHub
☆21Jul 3, 2026Updated 3 weeks ago
Vokturz / tsnmf-sparse
View on GitHub
Topic supervised non-negative matrix factorization with sparse matrices
☆12Mar 24, 2020Updated 6 years ago
qlan3 / Jaxplorer
View on GitHub
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
☆13Jul 19, 2024Updated 2 years ago
StanfordASL / RSIRL
View on GitHub
Risk-sensitive Inverse Reinforcement Learning
☆11Sep 11, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sungsulim / RLControl
View on GitHub
Implementation of Continuous Control RL Algorithms
☆11Dec 8, 2022Updated 3 years ago
SafeRL-Lab / Robust-RL-Baselines
View on GitHub
Robust Reinforcement Learning Benchmark
☆13Sep 22, 2024Updated last year
alialaradi / PairsTrading
View on GitHub
MATLAB code to produce results and figures in the paper "Stochastic Optimal Control of Pairs Trading Strategies with Absolute and Relativ…
☆15Jun 1, 2018Updated 8 years ago
Mythobeast / epidemicmodels
View on GitHub
SIR, SEIR, and beyond
☆10Jul 6, 2023Updated 3 years ago
facebookresearch / td-delta
View on GitHub
Separating value functions across time-scales.
☆18May 13, 2019Updated 7 years ago
TomZahavy / GrayingTheBox
View on GitHub
Code implementation of: "Graying the black box: Understanding DQNs"
☆20Feb 23, 2017Updated 9 years ago
rasalkumar / PowerElectronics
View on GitHub
DC Motor Tuning Using Fuzzy Logic And PID Controller
☆12Oct 8, 2020Updated 5 years ago
waterhorse1 / NAC
View on GitHub
(NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.
☆28Nov 19, 2021Updated 4 years ago
Amin-Razzaghi / Adaptive-Cruise-Control-based-on-MPC
View on GitHub
Simulation and Robotic Implementation of the Adaptive Cruise Control based on the Predictive Control Model
☆13Jan 12, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
m-hasan-n / roundabout
View on GitHub
Code of our work "Maneuver-based Anchor Trajectory Hypotheses at Roundabouts".
☆14Sep 15, 2022Updated 3 years ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
View on GitHub
☆30Aug 20, 2021Updated 4 years ago
kevinhongzl / awesome-operations-analytics
View on GitHub
A curated list of awesome video lectures and learning resources for operations analytics.
☆33Aug 23, 2021Updated 4 years ago
PranjayGoyal / Warehouse-Robots-Path-Planning
View on GitHub
Designing an optimized path for multiple robots in a warehouse for picking and delivery operations using A* algorithm (shortest path) and…
☆11Jul 28, 2023Updated 3 years ago
StepNeverStop / RLs
View on GitHub
Reinforcement Learning Algorithms Based on PyTorch
☆453Oct 21, 2021Updated 4 years ago
AleksandarHaber / Simulation-of-State-Space-Models-of-Dynamical-Systems-in-Cpp--Eigen-Matrix-Library-Tutorial
View on GitHub
☆16Apr 10, 2026Updated 3 months ago
lrhammond / almanac
View on GitHub
Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…
☆10May 5, 2022Updated 4 years ago
xubo92 / robotics-navigation-deep-reinforcement-learning
View on GitHub
Autonomous vehicle learn how to navigate efficiently at crossroad
☆16Jan 31, 2018Updated 8 years ago
atmughrabi / OpenGraph
View on GitHub
OpenGraph is an open-source graph processing benchmarking suite written in pure C/OpenMP.
☆14Apr 27, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
himeinhardt / MatTuGames
View on GitHub
A Matlab Toolbox for Cooperative Game Theory
☆47Feb 15, 2025Updated last year
nik7273 / covid-pgmorl
View on GitHub
Multi-objective reinforcement learning for covid-19 control
☆12Aug 12, 2021Updated 4 years ago
anonymous1234517 / code
View on GitHub
☆11Sep 22, 2019Updated 6 years ago
cwj22 / BeT-AIL
View on GitHub
☆13Mar 18, 2024Updated 2 years ago
SahanaRamnath / MultiArmedBandit_RL
View on GitHub
Implementation of various multi-armed bandits algorithms on a 10-arm testbed.
☆38Jan 16, 2020Updated 6 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
john-hewitt / truncation-sampling
View on GitHub
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆13Dec 6, 2022Updated 3 years ago