opendilab / SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆288Updated 7 months ago
Alternatives and similar repositories for SO2:
Users that are interested in SO2 are comparing it to the libraries listed below
- MiniWoB++: a web interaction benchmark for reinforcement learning☆11Updated last year
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆52Updated last year
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆467Updated last year
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆46Updated last year
- A simple toolkit package for opendilab☆116Updated last year
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆306Updated last year
- PyTorch Sphinx Theme☆53Updated 7 months ago
- Long-Term Evolution Project of Reinforcement Learning☆470Updated last month
- Decision Intelligence Adventure for Beginners☆72Updated 2 years ago
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆117Updated 6 months ago
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆218Updated 2 years ago
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆206Updated 3 months ago
- ☆172Updated 2 years ago
- OpenDILab RL Object Store☆175Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆112Updated last month
- if web site (spa) updated,we can discover☆205Updated 10 months ago
- DI-engine docs (Chinese and English)☆292Updated last month
- ☆209Updated 2 months ago
- Building open-ended embodied agent in battle royale FPS game☆37Updated last year
- A python Web Framework☆197Updated 6 months ago
- Decision Intelligence platform for Biological Sequence Searching☆113Updated 2 years ago
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆20Updated 2 years ago
- ☆187Updated 3 months ago
- Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own …☆182Updated last year
- mongo lambda query for spring boot plugin☆293Updated 9 months ago
- ☆166Updated 10 months ago
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆225Updated 7 months ago
- Competition work of qiniu 1024 code marathon☆267Updated 10 months ago
- go web 框架 golang web框架 轻量级高并发 go web framework☆641Updated last week
- ☆137Updated 3 weeks ago