opendilab / SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆290Updated 10 months ago
Alternatives and similar repositories for SO2:
Users that are interested in SO2 are comparing it to the libraries listed below
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆46Updated last year
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆52Updated last year
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆464Updated last year
- MiniWoB++: a web interaction benchmark for reinforcement learning☆11Updated 2 years ago
- Long-Term Evolution Project of Reinforcement Learning☆472Updated 2 weeks ago
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆308Updated last year
- Decision Intelligence Adventure for Beginners☆72Updated 2 years ago
- A simple toolkit package for opendilab☆116Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆125Updated 2 months ago
- PyTorch Sphinx Theme☆54Updated 9 months ago
- ☆208Updated last month
- if web site (spa) updated,we can discover☆204Updated last year
- ☆172Updated 2 years ago
- ☆165Updated last year
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆222Updated 2 years ago
- Building open-ended embodied agent in battle royale FPS game☆37Updated last year
- Competition work of qiniu 1024 code marathon☆265Updated last year
- SMP是一个「轻量级」流媒体平台。 它支持按流名称进行流或拉流,目前支持三种协议:StreamInfo、CodecInfo 和 Packet,分别管理流或拉的启动、编解码器信息和音频/视频帧。在网络方面,它利用基于boost::Asio的异步IO,允许并发处理大量流和拉取请…☆149Updated last year
- mongo lambda query for spring boot plugin☆290Updated 11 months ago
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆207Updated 6 months ago
- 1024 + 深度 强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆118Updated 9 months ago
- 从词表到微调这就是你所需的一切☆249Updated last year
- OpenDILab RL Object Store☆175Updated 3 years ago
- 对API接口中的CRUD场景进行了丰富的封装,大幅减少开发人员在这方面的编码工作。☆132Updated last year
- Decision Intelligence platform for Biological Sequence Searching☆115Updated 2 years ago
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆20Updated 2 years ago
- OpenAI 开放API的 会话 SDK,参考mybatis框架的sql会话工厂模型实现,只需配置,开箱即用~☆161Updated last year
- 清华智谱 ChatGLM 会话 SDK,参考mybatis框架的sql会话工厂模型实现。只需配置,开箱即用~☆153Updated last year
- DI-engine docs (Chinese and English)☆297Updated last month
- 基于 React + Spring Boot + Picocli + 对象存储的代码生成器共享平台,又分为 3 个循序渐进的子项目:基于命令行的本地代码生成器 + 代码生成器制作工具 + 在线代码生成器平台。实践 Java 命令行应用开发、FreeMarker 模板引擎、多…☆166Updated last year