niejnan / RL
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆16Updated last month
Alternatives and similar repositories for RL:
Users that are interested in RL are comparing it to the libraries listed below
- Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…☆12Updated last month
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆15Updated 8 months ago
- Brain-Body Co-Design for Embodied Agents: A Survey of Neural Approaches☆118Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆46Updated last month
- Advanced Driving Assistance System based on Jetson Nano☆82Updated 2 years ago
- ☆51Updated last month
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆48Updated 9 months ago
- ☆14Updated 4 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆173Updated 6 months ago
- WebChat 是一个功能强大的 Chrome/Edge 浏览器 AI 问答插件,可以帮助您在浏览网页时快速引用划线选中的内容与 AI 进行交互问答。并且可以自定义工具名称和提示词,选中文本后点击自定义工具按钮,可以一键执行"引用+提示词+发送"的操作组合。☆41Updated last week
- ☆244Updated 3 months ago
- This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.☆191Updated this week
- ☆28Updated 2 years ago
- ☆51Updated this week
- Run JavaScript code from Python.☆101Updated last month
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆52Updated this week
- CarbonSolAI☆39Updated last month
- [ICRA 2024] Official code for BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection☆2Updated 9 months ago
- ☆142Updated 11 months ago
- Awesome AI for Electricity☆41Updated 3 weeks ago
- 基于Go的goroutine及Go的并发编程实现的协程复用池☆68Updated last month
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆26Updated last month
- 提供项目中常用的工具函数,比如时间戳、格式的转换、数据类型判断等。如名字screw一样,做一个项目开发过程中的螺丝钉。☆25Updated this week
- DeepRug☆40Updated 2 months ago
- 一个基于 UniApp 框架开发的算法可视化与应用的项目。☆23Updated last month
- ☆52Updated last month
- 第五届字节跳动青训营后端进阶班-大项目极简版抖音-基于Kitex + Hertz + Gorm 的分布式视频APP服务端☆43Updated last year
- Main Project of AIDE☆91Updated 2 months ago
- Deep Seek AI-Driven Strategies, Blockchain-Verified Trust☆41Updated 2 months ago
- Common 3rd party API Simulator with payment api demo. Utilize Spring boot, Redis, MySQL, Docker, Groovy, Velocity, etc.☆38Updated last month