Examples and codes for the RL book
☆12Aug 20, 2024Updated last year
Alternatives and similar repositories for Introduction-to-Reinforcement-Learning-with-Examples-and-Codes
Users that are interested in Introduction-to-Reinforcement-Learning-with-Examples-and-Codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆20Feb 17, 2025Updated last year
- Dataset loader and renderer for aiMotive Multimodal Dataset☆12Oct 3, 2025Updated 8 months ago
- 寻墙算法,ros-melodic,读取laserscan msg,使用两个PID来控制距离和角度☆17Nov 26, 2020Updated 5 years ago
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆15Nov 28, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- compiler for fortran stencils using verified lifting,☆20Apr 5, 2022Updated 4 years ago
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆11May 24, 2026Updated 3 weeks ago
- Support Sustainable Computing to provide customer with metrics for their carbon footprint workload☆14Mar 26, 2026Updated 2 months ago
- ☆23Dec 4, 2024Updated last year
- Arduino Libraries☆14Jul 13, 2018Updated 7 years ago
- Simulator for the datacenter, including power, cooling, server and other components☆18Feb 12, 2025Updated last year
- ☆11May 5, 2026Updated last month
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 9 years ago
- A Data Converter for Nuplan and VAD(VADv2)☆24Nov 26, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- HEFT and CPOP task scheduling algorithms☆12Dec 6, 2018Updated 7 years ago
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆60Apr 8, 2026Updated 2 months ago
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆22Apr 15, 2018Updated 8 years ago
- Perceptive Learning for Legged Robots in IsaacLab. | LocoTouch: Learning Dynamic Quadrupedal Transport with Tactile Sensing (CoRL'25)☆58May 15, 2026Updated last month
- A Spatio-Temporal Multi-Agent Reinforcement Learning algorithm for cooperative traffic signal control.☆19Feb 2, 2024Updated 2 years ago
- An unofficial Wiki for UM-SJTU JI Dual-Degree Program.☆17Mar 27, 2023Updated 3 years ago
- Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…☆54Oct 30, 2023Updated 2 years ago
- Discover run time relationships between Kubernetes resources☆21Mar 22, 2024Updated 2 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Nov 10, 2023Updated 2 years ago
- Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms☆13Dec 17, 2017Updated 8 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Algorithms, 4th edition textbook code (using c++)☆15Oct 2, 2020Updated 5 years ago
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Build Neo4J Knowledge Graphs from Excel files☆23Nov 18, 2024Updated last year
- seminar for undergraduates☆16Jun 8, 2021Updated 5 years ago
- The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING☆17Apr 10, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Build a knowledge graph from UMLS Knowledge Sources (2022) with load, visualize and query with Neo4j and Scispacy☆27Sep 7, 2022Updated 3 years ago
- Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…☆18May 19, 2023Updated 3 years ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- Wall following robot using ROS and Python☆32May 27, 2019Updated 7 years ago
- ☆49Aug 20, 2025Updated 9 months ago
- ☆125Dec 30, 2025Updated 5 months ago
- ☆26Jul 11, 2023Updated 2 years ago