Examples and codes for the RL book
☆12Aug 20, 2024Updated last year
Alternatives and similar repositories for Introduction-to-Reinforcement-Learning-with-Examples-and-Codes
Users that are interested in Introduction-to-Reinforcement-Learning-with-Examples-and-Codes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆19Feb 17, 2025Updated last year
- Dataset loader and renderer for aiMotive Multimodal Dataset☆12Oct 3, 2025Updated 7 months ago
- 寻墙算法,ros-melodic,读取laserscan msg,使用两个PID来控制距离和角度☆17Nov 26, 2020Updated 5 years ago
- L4DC2021 code repository☆14Apr 14, 2021Updated 5 years ago
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆15Nov 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- compiler for fortran stencils using verified lifting,☆20Apr 5, 2022Updated 4 years ago
- Support Sustainable Computing to provide customer with metrics for their carbon footprint workload☆14Mar 26, 2026Updated last month
- This is the official PyTorch implementation for the HLGP algorithm used to solve large-scale CVRP.☆11Apr 29, 2026Updated last week
- ☆22Dec 4, 2024Updated last year
- Arduino Libraries☆14Jul 13, 2018Updated 7 years ago
- Simulator for the datacenter, including power, cooling, server and other components☆17Feb 12, 2025Updated last year
- ☆11Feb 7, 2024Updated 2 years ago
- Deep Learning - Multi-Task Representation Learning using Shared Architecture for Deep Neural Networks☆19Apr 11, 2017Updated 9 years ago
- A Data Converter for Nuplan and VAD(VADv2)☆24Nov 26, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- HEFT and CPOP task scheduling algorithms☆12Dec 6, 2018Updated 7 years ago
- The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models☆57Apr 8, 2026Updated 3 weeks ago
- Reinforcement Learning -- Imitation Learning, Behavior Cloning, DAgger (Data Aggregation)☆22Apr 15, 2018Updated 8 years ago
- Perceptive Learning for Legged Robots in IsaacLab. | LocoTouch: Learning Dynamic Quadrupedal Transport with Tactile Sensing (CoRL'25)☆58Sep 18, 2025Updated 7 months ago
- A Spatio-Temporal Multi-Agent Reinforcement Learning algorithm for cooperative traffic signal control.☆19Feb 2, 2024Updated 2 years ago
- An unofficial Wiki for UM-SJTU JI Dual-Degree Program.☆17Mar 27, 2023Updated 3 years ago
- Datasets and Papers (with codes) discussed in "Deep Learning for Video Object Segmentation: A Review", Artificial Intelligence Review, 20…☆54Oct 30, 2023Updated 2 years ago
- Discover run time relationships between Kubernetes resources☆21Mar 22, 2024Updated 2 years ago
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Nov 10, 2023Updated 2 years ago
- Solving the Travelers Salesman Problem using GPU ( Cuda ) using ANT and GA algorithms☆13Dec 17, 2017Updated 8 years ago
- Deep Reinforcement Learning framework that uses GNN to solve planning tasks for infrastructural assets☆17Jan 15, 2022Updated 4 years ago
- Algorithms, 4th edition textbook code (using c++)☆15Oct 2, 2020Updated 5 years ago
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- Experiments showing effects of parameters on Maximum Entropy Inverse Reinforcement Learning using grid world☆15Nov 26, 2016Updated 9 years ago
- Build Neo4J Knowledge Graphs from Excel files☆23Nov 18, 2024Updated last year
- seminar for undergraduates☆15Jun 8, 2021Updated 4 years ago
- The code of WEAKLY SUPERVISED NUCLEI SEGMENTATION VIA INSTANCE LEARNING☆17Apr 10, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Build a knowledge graph from UMLS Knowledge Sources (2022) with load, visualize and query with Neo4j and Scispacy☆27Sep 7, 2022Updated 3 years ago
- Creating a graph that summarizes correlations between stocks and using a Graph Neural Network to encode that information to be utilized i…☆18May 19, 2023Updated 2 years ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- Wall following robot using ROS and Python☆32May 27, 2019Updated 6 years ago
- ☆48Aug 20, 2025Updated 8 months ago
- ☆122Dec 30, 2025Updated 4 months ago
- Unsupervised Image Segmentation using WNet☆19Mar 28, 2018Updated 8 years ago