junnannie / RLLinks
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆22Updated 4 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆16Updated 11 months ago
- Brain-Body Co-Design in Embodied Intelligence: Taxonomy, Frontiers, and Challenges☆195Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆49Updated 5 months ago
- ☆248Updated 6 months ago
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆53Updated 3 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆124Updated last month
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆28Updated this week
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆174Updated 9 months ago
- ☆528Updated 2 weeks ago
- This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.☆607Updated last month
- Optimal Reciprocal Collision Avoidance (ORCA) - velocity obstacle☆46Updated 2 months ago
- 人车模拟器☆218Updated this week
- MetaDE is a GPU-accelerated evolutionary framework that optimizes Differential Evolution (DE) strategies via meta-level evolution. Suppor…☆125Updated 4 months ago
- Any-step Dynamics Model for Policy Optimization☆60Updated 5 months ago
- Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…☆13Updated 4 months ago
- ☆50Updated 4 months ago
- A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.☆127Updated last month
- ☆26Updated 4 months ago
- ☆37Updated last month
- 提供项目中常用的工具函数,比如时间戳、格式的转换、数据类型判断等。如名字screw一样,做一个项目开发过程中的螺丝钉。☆48Updated 3 weeks ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆49Updated last year
- Inspired by Recognition and Estimation of Human Finger Pointing (Authors: Eran Bamani, Eden Nissinman, Lisa Koenigsberg, Inbar Meir, Yoa…☆83Updated 4 months ago
- ☆14Updated 7 months ago
- A Python package for street view image perception analysis, providing tools for feature extraction and comfort prediction.☆79Updated 3 months ago
- Residual Kolmogorov-Arnold Network (RKAN) is designed to enhance the performance of classic deep learning models.☆272Updated last week
- ☆121Updated last month
- 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆82Updated 2 weeks ago
- Awesome AI for Electricity☆47Updated 4 months ago
- ☆281Updated last month
- Implementation of the paper Koopman Embedded Equivariant Control☆40Updated 5 months ago