opendilab / SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆288Updated 6 months ago
Alternatives and similar repositories for SO2:
Users that are interested in SO2 are comparing it to the libraries listed below
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆52Updated last year
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆46Updated last year
- MiniWoB++: a web interaction benchmark for reinforcement learning☆11Updated last year
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆467Updated last year
- Long-Term Evolution Project of Reinforcement Learning☆470Updated 2 weeks ago
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆307Updated last year
- PyTorch Sphinx Theme☆53Updated 6 months ago
- Decision Intelligence Adventure for Beginners☆72Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆108Updated last month
- A simple toolkit package for opendilab☆116Updated last year
- Building open-ended embodied agent in battle royale FPS game☆36Updated 11 months ago
- ☆172Updated 2 years ago
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆116Updated 5 months ago
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆206Updated 2 months ago
- ☆167Updated 2 months ago
- ☆131Updated 2 months ago
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆20Updated 2 years ago
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆225Updated 6 months ago
- DI-engine docs (Chinese and English)☆290Updated last month
- OpenDILab RL Object Store☆175Updated 2 years ago
- Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own …☆182Updated last year
- ☆210Updated last month
- OpenDILab RL Kubernetes Custom Resource and Operator Lib☆241Updated 2 years ago
- Decision Intelligence platform for Biological Sequence Searching☆113Updated 2 years ago
- Decision Intelligence platform for Traffic Crossing Signal Control☆234Updated last year
- if web site (spa) updated,we can discover☆207Updated 10 months ago
- mongo lambda query for spring boot plugin☆295Updated 8 months ago
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆218Updated 2 years ago
- Competition work of qiniu 1024 code marathon☆268Updated 9 months ago