junnannie / RLLinks
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆28Updated 7 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆16Updated last year
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆50Updated 7 months ago
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges☆214Updated last week
- Optimal Reciprocal Collision Avoidance (ORCA) - velocity obstacle☆50Updated 5 months ago
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆356Updated 3 weeks ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆126Updated last week
- ☆245Updated 9 months ago
- 用于无人机航拍实时目标检测☆60Updated 2 weeks ago
- Any-step Dynamics Model for Policy Optimization☆61Updated 8 months ago
- ☆26Updated 6 months ago
- ☆539Updated 2 months ago
- 🏕️ 动手学 Golang 服务端基础(中文)☆40Updated 2 months ago
- 基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型☆28Updated 2 months ago
- Awesome AI for Electricity☆113Updated 6 months ago
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆54Updated 5 months ago
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆94Updated 2 months ago
- MetaDE is a GPU-accelerated evolutionary framework that optimizes Differential Evolution (DE) strategies via meta-level evolution. Suppor…☆159Updated 6 months ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated 11 months ago
- ☆54Updated 3 months ago
- Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…☆16Updated 6 months ago
- ☆20Updated 4 months ago
- 解题助手,面试助手,在编码笔试或面试时,借助AI实时提供解题思路和答案。A interview assistant that leverages AI to provide real-time solutions during coding interviews.☆220Updated 3 weeks ago
- Decentralized LLMs fine-tuning and inference with offloading☆99Updated last week
- EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learn…☆190Updated 3 weeks ago
- [MM 2025] EventVAD: Training-Free Event-Aware Video Anomaly Detection☆503Updated 3 months ago
- EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)☆152Updated 3 months ago
- A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.☆133Updated 2 months ago
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆109Updated 4 months ago
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://arxiv.org/abs/2504.09946☆164Updated 3 weeks ago
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆140Updated this week