niejnan / RL
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆17Updated last month
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆16Updated 8 months ago
- Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…☆12Updated last month
- Brain-Body Co-Design for Embodied Agents: A Survey of Neural Approaches☆122Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆46Updated 2 months ago
- ☆245Updated 4 months ago
- Enabling robotic manipulators to learn to imitate human arm motions from given videos.☆47Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.☆314Updated last week
- ☆14Updated 5 months ago
- ☆50Updated last month
- sisuolv / 2021--CCF-Big-Data-Computing-Intelligence-Contest--Script-character-emotion-recognition--5thhttps://www.datafountain.cn/competitions/518☆13Updated 2 years ago
- Advanced Driving Assistance System based on Jetson Nano☆82Updated 2 years ago
- [ICRA 2024] Official code for BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection☆2Updated 10 months ago
- ☆23Updated 7 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆49Updated 9 months ago
- The ROS simulation, navigation, learning and control of robot☆11Updated 9 months ago
- ☆28Updated 2 years ago
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆52Updated 3 weeks ago
- sisuolv / 2021--CCF-Big-Data-Computing-Intelligence-Contest--System-authentication-risk-prediction--1sthttps://www.datafountain.cn/competitions/537☆19Updated 2 years ago
- ☆42Updated 3 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 6 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆113Updated 2 months ago
- 提供项目中常用的工具函数,比如时间戳、格式的转换、数据类型判断等。如名字screw一样,做一个项目开发过程中的螺丝钉。☆48Updated this week
- Implementation of CNN network based on VHDL☆19Updated 2 months ago
- CarbonSolAI☆39Updated 2 months ago
- ☆52Updated 2 months ago
- ☆28Updated 2 weeks ago
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆26Updated last month
- ☆22Updated 3 weeks ago
- Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)☆41Updated 3 months ago
- PCF8563 full-featured driver library for general MCU and Linux.☆30Updated last month