上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆40Mar 19, 2025Updated last year
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This section provides guidance on using CANopen to connect eRob in ROS2. For related examples, please refer to the sample code provided o…☆18Mar 3, 2025Updated last year
- ☆12Apr 26, 2023Updated 2 years ago
- ☆10Sep 23, 2019Updated 6 years ago
- ☆20Apr 12, 2025Updated 11 months ago
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆24Dec 2, 2025Updated 3 months ago
- ☆13Sep 27, 2020Updated 5 years ago
- Path planning of multi-agent-system for UAV use☆18Mar 20, 2023Updated 3 years ago
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆23Jun 9, 2025Updated 9 months ago
- ☆14Jul 23, 2021Updated 4 years ago
- PixShark系列水下机器人开源固件代码☆20Mar 13, 2024Updated 2 years ago
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- A SITL guide for setting up Ardupilot, Gazebo & ROS☆16Jul 27, 2020Updated 5 years ago
- Modbus TCP, Modbus UDP, Modbus Ascii and Modbus RTU client/server library for .NET implementations☆12May 29, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Jun 6, 2022Updated 3 years ago
- Huggingface PPO Demo☆25Sep 7, 2025Updated 6 months ago
- Set of environments to test various MoveIt! motion planning algorithms on the Baxter robot☆13Jun 25, 2019Updated 6 years ago
- ☆14Oct 27, 2025Updated 5 months ago
- The official code of our paper “RAG-Critic: Leveraging Automated Critic-Guided Agentic Workflow for Retrieval Augmented Generation”☆27Aug 19, 2025Updated 7 months ago
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Updated this week
- ☆14Jun 26, 2019Updated 6 years ago
- A balance_car simulation depending on ros2,ros2_control,LQR and gazebo☆42Jul 23, 2025Updated 8 months ago
- The code for task allocation and the simulation system based on ROS and Gazebo for task allocation are included☆18Jul 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆18Dec 15, 2025Updated 3 months ago
- ros2 ethercat package(ethercat plugins + gpio controllers) for omron☆17Jan 24, 2023Updated 3 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- 基于 Python3 制作的“操作系统”☆12Oct 17, 2025Updated 5 months ago
- tg机器人 trx兑换、能量租赁、trx闪兑自动回能量,-完整功能 https://t.me/hongsx☆52Mar 17, 2026Updated last week
- Arm Obstacle Avoidance Using Potential Filed in Isaac Lab☆18Feb 19, 2025Updated last year
- ROS Software for AgileX Scout Mini Navigation☆16Sep 11, 2023Updated 2 years ago
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 6 months ago
- AHT30 full-featured driver library for general-purpose MCU and Linux.☆13Oct 25, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆13Sep 14, 2022Updated 3 years ago
- [ICCV 2025] LocalDyGS : Multi-view Global Dynamic Scene Modeling through Adaptive Local Feature Decoupling☆114Feb 9, 2026Updated last month
- 一个集文档、代码实践于一体的技术知识库平台。包含文档、代码编辑、管理后台等5个应用的monorepo项目。采用Next.js、NestJS等现代技术栈,为开发者提供学习和实践平台。☆17Jul 21, 2025Updated 8 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- ROS2 gazebo simulator for underwater robots, including robot model, dynamics, controller, RL stuffs.☆20Mar 4, 2025Updated last year
- Package for control of ZeroErr eRob motors using ROS2 and EtherCAT + CIA402Drivers☆21Dec 16, 2024Updated last year
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆24Jan 4, 2026Updated 2 months ago