opendilab / SO2Links
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆291Updated last year
Alternatives and similar repositories for SO2
Users that are interested in SO2 are comparing it to the libraries listed below
Sorting:
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆461Updated last year
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆52Updated last year
- Long-Term Evolution Project of Reinforcement Learning☆475Updated 2 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆11Updated 2 years ago
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆46Updated last year
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆306Updated last year
- PyTorch Sphinx Theme☆54Updated 11 months ago
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆221Updated 2 years ago
- DI-engine docs (Chinese and English)☆302Updated 3 months ago
- A simple toolkit package for opendilab☆116Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆139Updated 4 months ago
- ☆172Updated 2 years ago
- Decision Intelligence Adventure for Beginners☆72Updated 2 years ago
- ☆207Updated 3 months ago
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆118Updated 11 months ago
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆207Updated 8 months ago
- if web site (spa) updated,we can discover☆203Updated last year
- Building open-ended embodied agent in battle royale FPS game☆38Updated last year
- ☆250Updated 7 months ago
- Decision Intelligence platform for Biological Sequence Searching☆115Updated 2 years ago
- Decision Intelligence platform for Traffic Crossing Signal Control☆238Updated 2 years ago
- RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random ne …☆404Updated 2 months ago
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆20Updated 2 years ago
- Competition work of qiniu 1024 code marathon☆263Updated last year
- OpenDILab RL Object Store☆175Updated 3 years ago
- mongo lambda query for spring boot plugin☆288Updated last year
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆241Updated 2 years ago
- 从词表到微调这就是你所需的一切☆249Updated last year
- A curated list of awesome exploration RL resources (continually updated)☆492Updated 4 months ago
- SMP是一个「轻量级」流媒体平台。 它支持按流名称进行流或拉流,目前支持三种协议:StreamInfo、CodecInfo 和 Packet,分别管理流或拉的启动、编解码器信息和音频/视频帧。在网络方面,它利用基于boost::Asio的异步IO,允许并发处理大量流和拉取请…☆149Updated last year