针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。
☆10Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for Q-learning
Users that are interested in Q-learning are comparing it to the libraries listed below
Sorting:
- OpenVLA Lightweight Version(0.5B). It uses qwen2-0.5B and fine-tunes using mllm format, without occupying LLM's inherent tokens. It repre…☆16Jan 7, 2026Updated last month
- ☆11Aug 9, 2018Updated 7 years ago
- Implementation of the paper Unsupervised Domain Adaptation by Backpropagation☆10Dec 1, 2018Updated 7 years ago
- RLCar Gazebo v2☆12Jun 28, 2024Updated last year
- Autonomous navigation simulation of an agricultural robot during soil fertilization in open fields using ROS and Gazebo.☆10Apr 8, 2025Updated 10 months ago
- sgbm立体匹配算法以及生成点云☆12Jan 29, 2021Updated 5 years ago
- Final Project of ME5413 Autonomous Mobile Robotics @ NUS☆10Oct 13, 2023Updated 2 years ago
- ☆13May 11, 2022Updated 3 years ago
- 戴西之海 - 先进数字集群:技术作者自留地☆12Jan 10, 2021Updated 5 years ago
- 电信采集项目 功能模块: 1、采集模块:对用户的使用信息进行定期数据采集。分为子服务器、中央服务器。子服务器解析计费信息并发送至中央服务器,中央服务器接受数据并插入数据库中由整合模块对数据进行整合处理。 2、整合模块:将采集模块发送的数据信息整合生成所有用户计费数据日表t…☆12Oct 29, 2017Updated 8 years ago
- Implementations of Influential Recommender System☆11Oct 29, 2024Updated last year
- A Three.js Start Kit with Webpack Hot Module Reload☆10Nov 27, 2017Updated 8 years ago
- ☆11Jan 13, 2022Updated 4 years ago
- SSGCN☆10Jul 23, 2020Updated 5 years ago
- 基于乐鑫 ESP32/ESP32-S2/S3 开发的小型无人机解决方案、基于北京理工大学自动化学院OLDX多旋翼开发平台(OLDX-FC)、基于正点原子ATK-F405☆21Apr 22, 2023Updated 2 years ago
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 8 months ago
- Localize the car in a static map with a particle filter.☆12Apr 2, 2025Updated 11 months ago
- Deep Q learning algorithm written on PyTorch for solving 2D robot arm reacher☆12Feb 19, 2020Updated 6 years ago
- Generalizable Stable Points Segmentation for 3D LiDAR Scan-to-Map Long-Term Localization☆17Jun 3, 2024Updated last year
- Semantic Lidar Odometry☆12May 1, 2020Updated 5 years ago
- RAL-2024, A key-frame based LiDAR global localization method.☆10Mar 23, 2024Updated last year
- CATIA&PDPS快速开发工具☆11Dec 8, 2022Updated 3 years ago
- Official Code Repository for the POLICEd-RL Paper: https://www.roboticsproceedings.org/rss20/p104.html☆13Mar 4, 2025Updated last year
- PhysReason Becnhmark☆19Jul 8, 2025Updated 7 months ago
- Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics☆12Mar 6, 2025Updated 11 months ago
- 使用ROS2+RL 的循迹小车☆12Aug 30, 2024Updated last year
- Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning☆12Dec 20, 2020Updated 5 years ago
- 这个仓库用于在ros2 humble中整合九轴IMU和车轮Odom的数据,其中的参数是经过多次实验后自行调整的。☆14Jan 25, 2025Updated last year
- Executive control code for STRANDS robots.☆11Feb 13, 2020Updated 6 years ago
- This project demonstrates the use of a physics informed neural network to estimate the state of ground vehicles☆13May 28, 2024Updated last year
- Analyse Social Network of co-authors in DBLP website (https://dblp.uni-trier.de) using NetworkX.☆14May 27, 2020Updated 5 years ago
- MVC、MVVM、MVVC三种模式☆10Feb 16, 2017Updated 9 years ago
- ☆14Nov 2, 2025Updated 4 months ago
- ☆10Sep 23, 2021Updated 4 years ago
- 國際STEAM Maker Forum(STEAM) 創客論壇☆17Aug 5, 2016Updated 9 years ago
- mcp server for robot and automations☆12Feb 27, 2025Updated last year
- ☆11Jan 6, 2024Updated 2 years ago
- Deep Introspective SLAM: Deep Reinforcement Learning based Approach to Avoid Tracking Failure in Visual SLAM☆11Jul 31, 2021Updated 4 years ago
- ☆11Jul 1, 2024Updated last year