niejnan / RLLinks

上海交通大学《动手学强化学习》课程笔记，完成了所有算法实现，包括但不限于 Actor-Critic、PPO、DDPG、DQN等

☆20

Alternatives and similar repositories for RL

Users that are interested in RL are comparing it to the libraries listed below

Sorting:

IceRain-y / Robot-Control-VirtualProtoType
The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…
☆16Updated 10 months ago
SMARTlab-Purdue / PrefMMT
This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…
☆49Updated 4 months ago
RoboCodeX-source / RoboCodeX_code
☆248Updated 6 months ago
Yuxing-Wang-THU / SurveyBrainBody
Brain-Body Co-Design in Embodied Intelligence: Taxonomy, Frontiers, and Challenges
☆195Updated last month
RoyZry98 / RepCaM-Pytorch
[TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery
☆53Updated 2 months ago
HITSZ-Robotics / DiffusionPolicy-Robotics
This repository contains a collection of resources and papers on Diffusion Models for Robotic Manipulation.
☆524Updated last week
louhz / robogs
☆186Updated last week
DaoyuanLi2816 / Kaggle-4th-Place-Solution-LMSYS-Chatbot-Arena-Human-Preference-Predictions
4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions
☆174Updated 8 months ago
niejnan / LLaVA
基于 Qwen2-0.5B 以及 SigLIP 实现的轻量化多模态风格化问答大模型
☆27Updated 3 months ago
RoyZry98 / BEVUDA-Pytorch
[ICRA 2024] Official code for BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection
☆2Updated last year
OpenHUTB / hutb
人车模拟器
☆83Updated this week
RoyZry98 / VeCAF-Pytorch
[MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness
☆49Updated 11 months ago
WaveSpeedAI / idea2product
☆204Updated 3 weeks ago
chengji253 / RVO2-python
Optimal Reciprocal Collision Avoidance (ORCA) - velocity obstacle
☆46Updated last month
EMI-Group / metade
MetaDE is a GPU-accelerated evolutionary framework that optimizes Differential Evolution (DE) strategies via meta-level evolution. Suppor…
☆112Updated 3 months ago
DuyutongDockBlocks16 / meta-xr-hitscollider-optimizer
Inspired by Recognition and Estimation of Human Finger Pointing (Authors: Eran Bamani, Eden Nissinman, Lisa Koenigsberg, Inbar Meir, Yoa…
☆83Updated 3 months ago
easonwangzk / site
☆50Updated 3 months ago
Riley702 / outlier-detection-tool
Outlier Detection Tool: A Python package using Cook's Distance to detect outliers in (x, y) datasets, providing actionable results for pr…
☆172Updated last month
Bill-Bi / RL_ramp_merging
☆121Updated 2 weeks ago
LouisXO / FlightAbove
FlightAbove transforms your Mac's menu bar into a real-time aviation radar, showing you exactly what aircraft are flying nearby.
☆42Updated this week
UlionTse / exejs
Run JavaScript code from Python.「髯祭司」是一个旨在使用python运行javascript的库。
☆105Updated 4 months ago
TianciGao / DiffPPO
Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning
☆121Updated 3 weeks ago
oioi-code / GoverDue
GoverDue is a blockchain-based government management smart contract that enables transparent and decentralized administration of governme…
☆28Updated 3 months ago
mr-yangfanqihang1 / Map
Spring项目：支持设置时间、价格、距离权重的个性化导航服务，并支持根据大量用户行驶状态更新道路情况和预计到达时间
☆22Updated 2 months ago
Sijie-Yang / UrbanCode
A Python package for street view image perception analysis, providing tools for feature extraction and comfort prediction.
☆65Updated 3 months ago
zxc-tju / InterHub
A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.
☆125Updated 2 weeks ago
COLA-Laboratory / TransOPT
Tansfer Optimization System for Black-box Optimization
☆251Updated 7 months ago
SSCT-Lab / DFauLo
☆23Updated 9 months ago
XenonJuice / Livonia
A lightweight web server implemented in Java
☆14Updated last month
IceRain-y / Sim_HumanoidRobot_Loong
Using an open-source humanoid robot project for secondary development, we will conduct a series of operations such as embodied intelligen…
☆13Updated 3 months ago