Multi-wavelength / Introduction-to-Reinforcement-Learning-with-Examples-and-CodesView external linksLinks
Examples and codes for the RL book
☆12Aug 20, 2024Updated last year
Alternatives and similar repositories for Introduction-to-Reinforcement-Learning-with-Examples-and-Codes
Users that are interested in Introduction-to-Reinforcement-Learning-with-Examples-and-Codes are comparing it to the libraries listed below
Sorting:
- Support Sustainable Computing to provide customer with metrics for their carbon footprint workload☆13Nov 22, 2025Updated 2 months ago
- HEFT and CPOP task scheduling algorithms☆12Dec 6, 2018Updated 7 years ago
- ☆20Dec 4, 2024Updated last year
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆18Jan 30, 2026Updated 2 weeks ago
- ☆11Feb 7, 2024Updated 2 years ago
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆10Feb 13, 2025Updated last year
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆15Nov 28, 2024Updated last year
- Dataset loader and renderer for aiMotive Multimodal Dataset☆11Oct 3, 2025Updated 4 months ago
- L4DC2021 code repository☆15Apr 14, 2021Updated 4 years ago
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- ☆15Nov 10, 2023Updated 2 years ago
- A Spatio-Temporal Multi-Agent Reinforcement Learning algorithm for cooperative traffic signal control.☆19Feb 2, 2024Updated 2 years ago
- Arduino Libraries☆14Jul 13, 2018Updated 7 years ago
- compiler for fortran stencils using verified lifting,☆19Apr 5, 2022Updated 3 years ago
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 8 years ago
- Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms☆13Dec 17, 2017Updated 8 years ago
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- 寻墙算法,ros-melodic,读取laserscan msg,使用两个PID来控制距离和角度☆17Nov 26, 2020Updated 5 years ago
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆18Feb 17, 2025Updated 11 months ago
- A Data Converter for Nuplan and VAD(VADv2)☆23Nov 26, 2024Updated last year
- Couette flow and Poiseuille flow☆20Jan 6, 2024Updated 2 years ago
- Algorithms, 4th edition textbook code (using c++)☆15Oct 2, 2020Updated 5 years ago
- seminar for undergraduates☆15Jun 8, 2021Updated 4 years ago
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆21Apr 15, 2018Updated 7 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- An unofficial Wiki for UM-SJTU JI Dual-Degree Program.☆17Mar 27, 2023Updated 2 years ago
- The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING☆17Apr 10, 2023Updated 2 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 4 years ago
- Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…☆17May 19, 2023Updated 2 years ago
- Build a knowledge graph from UMLS Knowledge Sources (2022) with load, visualize and query with Neo4j and Scispacy☆25Sep 7, 2022Updated 3 years ago
- Perceptive Learning for Legged Robots in IsaacLab. | LocoTouch: Learning Dynamic Quadrupedal Transport with Tactile Sensing (CoRL'25)☆54Sep 18, 2025Updated 4 months ago
- libpomdp is a set of POMDP approximation algorithms implemented in Java and Matlab☆29Jul 22, 2014Updated 11 years ago
- Discover run time relationships between Kubernetes resources☆21Mar 22, 2024Updated last year
- ☆23Jul 11, 2023Updated 2 years ago
- Unsupervised Image Segmentation using WNet☆19Mar 28, 2018Updated 7 years ago
- Decomposition Based Multi-Objective Particle Swarm Optimization☆27Jan 11, 2017Updated 9 years ago
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago