opendilab / SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
☆285Updated 2 months ago
Related projects: ⓘ
- Auxiliary code for pulling, loading reinforcement learning models based on DI-engine from the Huggingface Hub, or pushing them onto Huggi…☆46Updated 9 months ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆12Updated last year
- CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)☆45Updated 8 months ago
- Code to reproduce the experiments in Sample Efficient Reinforcement Learning via Model-Ensemble Exploration and Exploitation (MEEE).☆479Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆53Updated last week
- Long-Term Evolution Project of Reinforcement Learning☆464Updated 3 weeks ago
- A simple toolkit package for opendilab☆113Updated 8 months ago
- GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming(Ye et al., ICML 2023): https://openreview.net/pdf?id=tX7aj…☆311Updated last year
- PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)☆105Updated last month
- Decision Intelligence Adventure for Beginners☆68Updated last year
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆38Updated last month
- PyTorch Sphinx Theme☆50Updated 2 months ago
- 1024 + 深度强化学习(Deep Reinforcement Learning + 1024 Game/ 2048 Game)☆109Updated last month
- ☆173Updated last year
- DI-engine docs (Chinese and English)☆279Updated 2 months ago
- The first decision intelligence platform covering the most complete algorithms in academia and industry☆19Updated last year
- Let DI-treetensor help you simplify the structure processing!(树形运算一不小心就逻辑混乱?DI-treetensor快速帮你搞定)☆202Updated last year
- OpenDILab RL Object Store☆177Updated 2 years ago
- A curated list of awesome exploration RL resources (continually updated)☆369Updated 3 weeks ago
- Decision Intelligence platform for Biological Sequence Searching☆111Updated last year
- ☆213Updated last month
- RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random ne…☆350Updated 3 weeks ago
- [AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".☆212Updated last year
- Here are the most awesome tree structure computing solutions, make your life easier. (这里有目前性能最优的树形结构计算解决方案)☆229Updated last week
- OpenDILab RL HPC OP Lib, including CUDA and Triton kernel☆219Updated 2 months ago
- Decision Intelligence platform for Traffic Crossing Signal Control☆230Updated last year
- Still struggling with the high threshold or looking for the appropriate baseline? Come here and new starters can also play with your own …☆185Updated last year
- ☆172Updated 6 months ago
- 从词表到微调这就是你所需的一切☆256Updated 9 months ago
- Decision Intelligence for digging best parameters in target environment.☆90Updated last year