junnannie / RLLinks
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆23Updated 5 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆16Updated last year
- Brain-Body Co-Design in Embodied Intelligence: Taxonomy, Frontiers, and Challenges☆195Updated 2 months ago
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆49Updated 6 months ago
- Optimal Reciprocal Collision Avoidance (ORCA) - velocity obstacle☆47Updated 3 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆125Updated this week
- ☆246Updated 7 months ago
- ☆534Updated last month
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆56Updated 4 months ago
- MetaDE is a GPU-accelerated evolutionary framework that optimizes Differential Evolution (DE) strategies via meta-level evolution. Suppor…☆140Updated 5 months ago
- ☆22Updated 3 weeks ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆340Updated last month
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆158Updated last week
- Awesome collection of resources and papers on Diffusion Models for Robotic Manipulation.☆664Updated last week
- Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…☆15Updated 5 months ago
- Awesome AI for Electricity☆110Updated 5 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆172Updated 10 months ago
- ☆26Updated 5 months ago
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆28Updated 3 weeks ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- Any-step Dynamics Model for Policy Optimization☆60Updated 6 months ago
- Decentralized LLMs fine-tuning and inference with offloading☆98Updated last week
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆109Updated 2 months ago
- 🏕️ 动手学 Golang 服务端基础(中文)☆40Updated 2 weeks ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆90Updated 3 weeks ago
- ☆50Updated 5 months ago
- 提供项目中常用的工具函数,比如时间戳、格式的转换、数据类型判断等。如名字screw一样,做一个项目开发过程中的螺丝钉。☆47Updated 2 weeks ago
- EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)☆130Updated 2 months ago
- An Agent development framework that integrates MCP☆60Updated 5 months ago
- ☆53Updated last month
- ☆120Updated last month