junnannie / RLLinks
上海交通大学《动手学强化学习》课程笔记,完成了所有算法实现,包括但不限于 Actor-Critic、PPO、DDPG、DQN等
☆31Updated 7 months ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- [IROS 2025] MarineGym: A High-Performance Reinforcement Learning Platform for Underwater Robotics☆88Updated 3 weeks ago
- The simulation of various types of robot control systems is conducted by using Simulink, focusing on robot configuration design, kinemati…☆16Updated last year
- Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges☆236Updated this week
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆50Updated 8 months ago
- Optimal Reciprocal Collision Avoidance (ORCA) - velocity obstacle☆52Updated 5 months ago
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆360Updated 2 weeks ago
- ☆247Updated 10 months ago
- ☆26Updated 7 months ago
- [TMC 2025/NOSSDAV 2023] Official code for RepCaM++ and RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery☆54Updated 6 months ago
- Awesome AI for Electricity☆115Updated 7 months ago
- A collection of diffusion inversion methods.☆97Updated 2 months ago
- Decentralized LLMs fine-tuning and inference with offloading☆102Updated this week
- EvoMO is a GPU-accelerated library for evolutionary multiobjective optimization (EMO)☆167Updated last month
- MetaDE is a GPU-accelerated evolutionary framework that optimizes Differential Evolution (DE) strategies via meta-level evolution. Suppor…☆174Updated 7 months ago
- ☆544Updated 2 weeks ago
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated last year
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆99Updated 3 months ago
- 🏕️ 动手学 Golang 服务端基础(中文)☆40Updated 3 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆130Updated last month
- 用于无人机航拍实时目标检测☆62Updated last month
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆111Updated 5 months ago
- EvoRL is a fully GPU-accelerated framework for Evolutionary Reinforcement Learning, implemented with JAX. It supports Reinforcement Learn…☆208Updated last month
- ☆58Updated 2 months ago
- [ACM MM 2025] Official implementation of "DualSG: A Dual-Stream Explicit Semantic-Guided Multivariate Time Series Forecasting Framework"☆94Updated 2 weeks ago
- Llama from scratch in CUDA with Flash Attention.☆44Updated 3 weeks ago
- ☆55Updated 4 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆242Updated 2 weeks ago
- highly customizable laser control CAD software designed for industrial-grade laser processing, precision positioning, and automation cont…☆164Updated last month
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 4 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆234Updated 2 months ago