opendilab / SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆289Updated 9 months ago
Alternatives and similar repositories for SO2:
Users that are interested in SO2 are comparing it to the libraries listed below
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆52Updated last year
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆46Updated last year
- MiniWoB++: a web interaction benchmark for reinforcement learning☆11Updated 2 years ago
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆466Updated last year
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆308Updated last year
- Long-Term Evolution Project of Reinforcement Learning☆469Updated 2 months ago
- Decision Intelligence Adventure for Beginners☆72Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆123Updated last month
- A simple toolkit package for opendilab☆116Updated last year
- Building open-ended embodied agent in battle royale FPS game☆37Updated last year
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆220Updated 2 years ago
- PyTorch Sphinx Theme☆53Updated 8 months ago
- ☆145Updated 2 months ago
- DI-engine docs (Chinese and English)☆296Updated 3 weeks ago
- ☆172Updated 2 years ago
- ☆234Updated 4 months ago
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆117Updated 8 months ago
- if web site (spa) updated,we can discover☆204Updated last year
- ☆208Updated last week
- RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random ne…☆387Updated 5 months ago
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆241Updated 2 years ago
- ☆165Updated last year
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆20Updated 2 years ago
- Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own …☆181Updated 2 years ago
- OpenDILab RL Object Store☆175Updated 2 years ago
- OpenAI 开放API的 会话 SDK,参考mybatis框架的sql会话工厂模型实现,只需配置,开箱即用~☆161Updated last year
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆206Updated 5 months ago
- Control Google Slides With Just Hand Gestures!☆204Updated last year
- A curated list of awesome exploration RL resources (continually updated)☆458Updated last month
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆225Updated 8 months ago