《强化学习-原理与Python实现》的Pytorch实现。
☆64Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for RL-Python-Pytorch
Users that are interested in RL-Python-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source codes for the book "Reinforcement Learning: Theory and Python Implementation"☆1,025Oct 26, 2025Updated 6 months ago
- 预测-校正学习计算制导律☆13Jun 22, 2021Updated 4 years ago
- Repository for Plexe Sumo☆14Aug 4, 2018Updated 7 years ago
- 🚖 3D model viewer with labels made with THREE.js☆13Dec 8, 2022Updated 3 years ago
- RAMRL: Towards Robust On-Ramp Merging via Augmented Multimodal Reinforcement Learning☆10Jul 10, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Introduction to surrogate modeling optimization in wireless networks☆10May 10, 2018Updated 8 years ago
- Application of model predictive control (MPC) on the highway-env simulator. Controller takes into account predicted trajectories for all …☆13Aug 8, 2023Updated 2 years ago
- Official documentation of I-24 MOTION data products☆21Oct 16, 2023Updated 2 years ago
- ☆22Feb 27, 2020Updated 6 years ago
- Szilard Bessenyei's submitted projects for the Udacity Self-Driving Car Engineer course.☆13Aug 29, 2019Updated 6 years ago
- Project for Elective in Robotics: Control of Multi-robot system, Univ. La Sapienza Roma, 2020.☆11Jan 25, 2021Updated 5 years ago
- This is the repo for the submitted paper "Automated Lane Merging via Game Theory and Branch Model Predictive Control".☆15Aug 14, 2024Updated last year
- This project uses computer vision in combination with a table football table to analyze the movement of the ball.☆12Apr 22, 2026Updated last month
- A vehicular network simulator (SUMO+ns-3) encapsulated behind Gym API (OpenAI-Gym), which allows users to evaluate RL-enhanced vehicular …☆40Oct 24, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Datasets and source codes for paper "Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning Interpretability"☆17Nov 17, 2021Updated 4 years ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- 《动手学强化学习》练习代码(Pytorch)☆20Sep 16, 2022Updated 3 years ago
- A Multi-Operator Equivariant Framework for High-Performance Machine Learning Force Fields, supporting External Fields embedding and Physi…☆17May 9, 2026Updated 2 weeks ago
- ALSET Autonomous Vehicles with Full Self Driving Capabilities: Model S: A toy RC robot with arm and tracked wheels. Model X: a toy RC Ex…☆12Jan 12, 2024Updated 2 years ago
- ☆16Mar 19, 2022Updated 4 years ago
- Pose Estimation for drones☆20Oct 23, 2025Updated 7 months ago
- Official Github Repository for "Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints". (NeurIPS 2023)☆20Nov 30, 2025Updated 5 months ago
- An autonomous driving agent with a Safety model and the ATtention mechanism in a multi-task framework.☆15Jan 13, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Experimenting with infinite terrain generation using ROAM level of detail☆13Mar 13, 2018Updated 8 years ago
- [动手学强化学习]系列,基于pytorch。☆59Jun 2, 2021Updated 4 years ago
- Using two team game theory, we can make the truck platoons resilient against any malicious attack.☆15May 29, 2019Updated 6 years ago
- Ride Hailing Simulation - A data-driven approach to model a simulation environment☆13Feb 16, 2021Updated 5 years ago
- An educational tool to introduce AI planning concepts using mobile manipulator robots.☆15Aug 6, 2024Updated last year
- This repository contains the information related to the benchmark study on openly available OCSR tools☆45May 3, 2021Updated 5 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- PyRossGeo is a numerical library for spatially resolved mathematical modelling of infectious diseases in Python - https://github.com/luka…☆18Sep 22, 2020Updated 5 years ago
- This repository offers J2735 standard messages modules for Python based on Pyasn.1 library. Some other utility functions are also made av…☆12Aug 25, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16May 20, 2021Updated 5 years ago
- [KI'22] Official implementation of the paper "Solving the Traveling Salesperson Problem with Precedence Constraints (TSPPC) by Deep Reinf…☆13Sep 19, 2022Updated 3 years ago
- An implementation of the Traffic Control Interface for Matlab☆16Jun 4, 2021Updated 4 years ago
- BiSeNet in pytorch☆10Oct 12, 2018Updated 7 years ago
- A PyTorch implementation of Fader Networks: Manipulating Images by Sliding Attributes by Lample et al.☆12Aug 27, 2017Updated 8 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago