Multi-wavelength/Introduction-to-Reinforcement-Learning-with-Examples-and-Codes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Multi-wavelength/Introduction-to-Reinforcement-Learning-with-Examples-and-Codes)

Multi-wavelength / Introduction-to-Reinforcement-Learning-with-Examples-and-Codes

Examples and codes for the RL book

☆12

Alternatives and similar repositories for Introduction-to-Reinforcement-Learning-with-Examples-and-Codes

Users that are interested in Introduction-to-Reinforcement-Learning-with-Examples-and-Codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

documentdb / booking-agents-sample
View on GitHub
☆16May 2, 2026Updated 2 months ago
rycolab / odpo
View on GitHub
This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).
☆21Feb 17, 2025Updated last year
numb0824 / wall-following-robot
View on GitHub
寻墙算法，ros-melodic，读取laserscan msg，使用两个PID来控制距离和角度
☆17Nov 26, 2020Updated 5 years ago
Rose-STL-Lab / V2V-traffic-forecast
View on GitHub
L4DC2021 code repository
☆14Apr 14, 2021Updated 5 years ago
aimotive / aimotive-dataset-loader
View on GitHub
Dataset loader and renderer for aiMotive Multimodal Dataset
☆12Oct 3, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
panyxy / hlgp_cvrp
View on GitHub
This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.
☆11May 24, 2026Updated 2 months ago
caas-team / caas-carbon-footprint
View on GitHub
Support Sustainable Computing to provide customer with metrics for their carbon footprint workload
☆14Mar 26, 2026Updated 4 months ago
824728350 / Zodiac
View on GitHub
Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024
☆15Nov 28, 2024Updated last year
sina33 / heft
View on GitHub
HEFT and CPOP task scheduling algorithms
☆12Dec 6, 2018Updated 7 years ago
uwplse / stng
View on GitHub
compiler for fortran stencils using verified lifting,
☆20Apr 5, 2022Updated 4 years ago
uArm-Developer / UF_uArm_Metal
View on GitHub
Arduino Libraries
☆14Jul 13, 2018Updated 8 years ago
marmotlab / PAN-CAS
View on GitHub
☆11May 5, 2026Updated 2 months ago
bikaldev / MARL-TrafficLight
View on GitHub
A Spatio-Temporal Multi-Agent Reinforcement Learning algorithm for cooperative traffic signal control.
☆20Feb 2, 2024Updated 2 years ago
harpribot / representation-music
View on GitHub
Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks
☆19Apr 11, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xiaoxiaxusummer / PASS_Discrete
View on GitHub
Code Repository for Pinching-Antenna Systems (PASS): Power Radiation Model and Optimal Beamforming Design, published by IEEE TCOM https:/…
☆16Mar 19, 2026Updated 4 months ago
JhengLu / OpenInfra
View on GitHub
Simulator for the datacenter, including power, cooling, server and other components
☆19Feb 12, 2025Updated last year
SCH1001 / SLNR
View on GitHub
A super lightweight neural representation for large-scale 3D mapping
☆21May 31, 2026Updated last month
priest-yang / Nuplan-VAD-DataGen
View on GitHub
A Data Converter for Nuplan and VAD(VADv2)
☆24Nov 26, 2024Updated last year
GKthom / DeepQnetworks
View on GitHub
MATLAB implementation of DQN for a navigation environment
☆13Aug 13, 2020Updated 5 years ago
GC-Advising-Center / DD-Wiki
View on GitHub
An unofficial Wiki for UM-SJTU JI Dual-Degree Program.
☆17Mar 27, 2023Updated 3 years ago
kylewray / nova
View on GitHub
CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.
☆18Jun 18, 2021Updated 5 years ago
ZWLab23 / Delay-and-Battery-Degradation-Optimization-based-on-PPO-for-Task-Offloading-in-RSU-assisted-IoV
View on GitHub
☆16Nov 10, 2023Updated 2 years ago
gaomingqi / VOS-Review
View on GitHub
Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…
☆54Oct 30, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AntNLP / undergraduates-seminar
View on GitHub
seminar for undergraduates
☆16Jun 8, 2021Updated 5 years ago
rachitmehrotra1 / TSP-GPU
View on GitHub
Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms
☆13Dec 17, 2017Updated 8 years ago
davidkerkkamp / DQN-GNN
View on GitHub
Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets
☆17Jan 15, 2022Updated 4 years ago
WadeYin9712 / UI-Simulator
View on GitHub
Code for 🌍 UI-Simulator: LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training
☆21Oct 17, 2025Updated 9 months ago
AceCoooool / algs4cplusplus
View on GitHub
Algorithms, 4th edition textbook code (using c++)
☆15Oct 2, 2020Updated 5 years ago
eliabntt / animated_human_SMPL_to_USD
View on GitHub
Code used in the GRADE framework to convert SMPL animation data to the USD file format to be used in the IsaacSim/Omniverse simulators.
☆21Jun 28, 2023Updated 3 years ago
cloud-ark / kubediscovery
View on GitHub
Discover run time relationships between Kubernetes resources
☆21Mar 22, 2024Updated 2 years ago
blacksmithop / LLM-Graph-Builder
View on GitHub
Build Neo4J Knowledge Graphs from Excel files
☆23Nov 18, 2024Updated last year
harpribot / IRL-maxent
View on GitHub
Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world
☆15Nov 26, 2016Updated 9 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
weizhenFrank / WeakNucleiSeg
View on GitHub
The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING
☆17Apr 10, 2023Updated 3 years ago
floodsung / DDPG-tensorflow
View on GitHub
DDPG on OpenAI Gym Pendulum
☆17Jul 1, 2016Updated 10 years ago
ssscassio / ros-wall-follower-2-wheeled-robot
View on GitHub
Wall following robot using ROS and Python
☆33May 27, 2019Updated 7 years ago
MatthewRajan13 / GNN-RL-Stock-Predictor
View on GitHub
Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…
☆18May 19, 2023Updated 3 years ago
lalwdl / LBM-Fluid-Structure-Interaction
View on GitHub
Couette flow and Poiseuille flow
☆21Jan 6, 2024Updated 2 years ago
rudolfsteiner / DAgger
View on GitHub
Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)
☆22Apr 15, 2018Updated 8 years ago
zhiguo-ding / pinching_antennas
View on GitHub
☆27Dec 4, 2024Updated last year