Examples and codes for the RL book
☆12Aug 20, 2024Updated last year
Alternatives and similar repositories for Introduction-to-Reinforcement-Learning-with-Examples-and-Codes
Users that are interested in Introduction-to-Reinforcement-Learning-with-Examples-and-Codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆18Feb 17, 2025Updated last year
- Dataset loader and renderer for aiMotive Multimodal Dataset☆12Oct 3, 2025Updated 6 months ago
- 寻墙算法,ros-melodic,读取laserscan msg,使用两个PID来控制距离和角度☆17Nov 26, 2020Updated 5 years ago
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆15Nov 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- compiler for fortran stencils using verified lifting,☆20Apr 5, 2022Updated 4 years ago
- Support Sustainable Computing to provide customer with metrics for their carbon footprint workload☆14Mar 26, 2026Updated 3 weeks ago
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆10Feb 13, 2025Updated last year
- ☆20Dec 4, 2024Updated last year
- Arduino Libraries☆14Jul 13, 2018Updated 7 years ago
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- ☆11Feb 7, 2024Updated 2 years ago
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 9 years ago
- A Data Converter for Nuplan and VAD(VADv2)☆24Nov 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- HEFT and CPOP task scheduling algorithms☆12Dec 6, 2018Updated 7 years ago
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆36Apr 8, 2026Updated last week
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆22Apr 15, 2018Updated 8 years ago
- Perceptive Learning for Legged Robots in IsaacLab. | LocoTouch: Learning Dynamic Quadrupedal Transport with Tactile Sensing (CoRL'25)☆57Sep 18, 2025Updated 6 months ago
- A Spatio-Temporal Multi-Agent Reinforcement Learning algorithm for cooperative traffic signal control.☆19Feb 2, 2024Updated 2 years ago
- An unofficial Wiki for UM-SJTU JI Dual-Degree Program.☆17Mar 27, 2023Updated 3 years ago
- Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…☆54Oct 30, 2023Updated 2 years ago
- Discover run time relationships between Kubernetes resources☆21Mar 22, 2024Updated 2 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Nov 10, 2023Updated 2 years ago
- Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms☆13Dec 17, 2017Updated 8 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Algorithms, 4th edition textbook code (using c++)☆15Oct 2, 2020Updated 5 years ago
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Build Neo4J Knowledge Graphs from Excel files☆23Nov 18, 2024Updated last year
- seminar for undergraduates☆15Jun 8, 2021Updated 4 years ago
- The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING☆17Apr 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build a knowledge graph from UMLS Knowledge Sources (2022) with load, visualize and query with Neo4j and Scispacy☆25Sep 7, 2022Updated 3 years ago
- Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…☆18May 19, 2023Updated 2 years ago
- ☆46Aug 20, 2025Updated 7 months ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- Wall following robot using ROS and Python☆32May 27, 2019Updated 6 years ago
- ☆117Dec 30, 2025Updated 3 months ago
- Unsupervised Image Segmentation using WNet☆19Mar 28, 2018Updated 8 years ago