johnjim0816 / joyrl-bookLinks
☆16Updated last year
Alternatives and similar repositories for joyrl-book
Users that are interested in joyrl-book are comparing it to the libraries listed below
Sorting:
- An easier PyTorch deep reinforcement learning library.☆236Updated 8 months ago
- 此项目中将上传我在B站《强化学习理论基础》系列视频中的板书、参考资料等内容。☆80Updated 2 years ago
- ☆66Updated last year
- ☆90Updated 3 years ago
- 📖 Full Stack Practice of the Large Language Model Training @ RLChina 2024☆40Updated 10 months ago
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆164Updated last year
- The mirror of RL_Coding_Exercise.☆103Updated last year
- basic algorithms of reinforcement learning☆213Updated 2 years ago
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆78Updated 4 months ago
- ☆168Updated last year
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 4 years ago
- A collection of LLM with RL papers☆277Updated last year
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆17Updated last year
- GitHub's code repository is all you need☆355Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆48Updated 5 months ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆376Updated last year
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A curated list of RL resources☆44Updated 3 weeks ago
- 天授中文文档☆58Updated 8 months ago
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆20Updated 3 years ago
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆157Updated 4 years ago
- Research Papers and Code Repository on the Integration of Evolutionary Algorithms and Reinforcement Learning☆298Updated last year
- Baseline for NeurIPS_Auto_Bidding_General_Track☆35Updated last year
- Code for running RL experiments on continuing (non-episodic) problems.☆19Updated 3 weeks ago
- ☆25Updated 4 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆89Updated last year
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆103Updated 6 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆287Updated 4 months ago
- Unified Reinforcement Learning Framework☆770Updated last year
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆479Updated 11 months ago